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Abstract. We give a self-contained treatment of the theory of persistence modules indexed 
over the real line. We give new proofs of the standard results. Persistence diagrams are 
constructed using measure theory. Linear algebra lemmas are simplified using a new notation 
Cs| for calculations on quiver representations. We show that the stringent finiteness conditions 

required by traditional methods are not necessary to prove the existence and stability of the 
persistence diagram. We introduce weaker hypotheses for taming persistence modules, which 
are met in practice and are strong enough for the theory still to work. The constructions 
and proofs enabled by our framework are, we claim, cleaner and simpler. 
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Introduction 



We intend this paper to be a self-contained treatment of the theory of persistence modules 
over the real line. We give the best proofs we know of the most important results. Each 
theorem is located at the appropriate level of abstraction (we believe). 

Many authors have studied persistence modules in recent years, and many of the theorems 
presented here are not original in themselves. The originality lies in the methods we use, 
which give easier proofs and sharper results. Our main innovations are these: 

• We use measure theory to construct persistence diagrams. The existence of a diagram 
is equivalent to the existence of a certain kind of measure on rectangles in the plane. 

• We introduce 'decorated' real numbers to remove ambiguity about interval endpoints. 
Decorations are also what make the measure theory work. 

• We define several kinds of 'tameness' for a persistence module. These occur naturally 
in practice. The most restrictive of these, finite type, is what is normally seen in the 
literature. We show how to work effectively with the less restrictive hypotheses. 

• We introduce a special notation for calculations on quiver representations. This con- 
siderably simplifies the linear algebra (for instance, in proving the 'box lemma'). 

Our goal in introducing these ideas is to enable other authors to define persistence diagrams 
cleanly, in a wide variety of situations, without imposing unnecessary restrictions (such as 
assuming a function to be Morse). For instance, it will be seen in forthcoming work that the 
approach here can be used to define the levelset zigzag persistence of [5] quite broadly. 

This paper owes much to [7J (and its published journal version [5]), which established the 
existence and stability of persistence diagrams for modules whose persistence maps are of 
finite rank. In the present work we call these modules 'q-tame'. 

Traditionally, continuous persistence diagrams have been treated in one of two ways. Most 
commonly, one makes the aggressive assumption that the situation being studied has only 
finitely many 'critical values'. Alternatively, as carried out in [JJ for q-tame modules, the 
diagram is constructed using a careful limiting process through ever-finer discretisations of 
the parameter. The former strategy may be appropriate when working with real-world data, 
where every persistence module really is finite in every way, but it excludes very common 
theoretical situations. The latter strategy was devised to overcome these restrictions, but 
unfortunately the limiting arguments turn out to be quite complicated. Our new approach 
gives the best of both worlds; we are able to work with broader classes of persistence modules, 
and we can reason about their diagrams in a clean way using arguments of a finite nature. 

The other debt to [7J is the recasting of stability as a statement about interleaved persistence 
modules, the so-called 'algebraic stability theorem'. In this paper we re-recast the result as 
a statement about 1-parameter families of measures. This allows us to prove stability results 
for even quite badly behaved persistence modules. 
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Overview. The paper is organised as follows. 

In section [TJ we set up the basic properties of persistence modules. These can be defined over 
any partially ordered set; we are primarily interested in persistence modules over the real line. 
In the best case a persistence module can be expressed as a direct sum of 'interval modules', 
which can be thought of as the atomic building blocks of the theory. Not all persistence 
modules decompose in this way, so we spend much of this paper developing techniques 
that work without this assumption. These techniques depend on a thorough understanding 
of finitely-indexed persistence modules known as 'A n -quiver representations' [TH] [12]. We 
introduce a special notation for performing calculations on these quiver representations. 
This 'quiver calculus' is used throughout the paperfj] 

Section [2] addresses the question of how to define the diagram of a persistence module. This 
is easy for modules which decompose into intervals. To handle the general case, we establish 
an equivalence between diagrams and a certain kind of measure defined on rectangles in the 
plane. This means that whenever a persistence diagram is sought, it is enough to construct 
the corresponding persistence measure; and theorems about a diagram can be replaced by 
simpler-to-prove theorems about its measure. The diagram exists wherever the measure 
takes finite values. This leads to several different notions of 'tameness'. There are large 
classes of examples of naturally occurring persistence modules which are tame enough for 
their diagrams to be defined everywhere or almost everywhere. 

In order to make the measure theory work, we use real numbers that are 'decorated' with 
a superscript + or _ . We use decorations also to indicate whether a real interval is open, 
closed or half-open. For persistence modules which decompose into interval modules, these 
two uses match up perfectly. 

There is a 'snapping principle' by which our abstractly defined diagrams are seen to agree 
with the diagrams that are produced by the standard algorithms [131120], for example when 
working with finite simplicial complexes derived from real data. 

In section [3j we study interleavings. An interleaving is an approximate isomorphism between 
two persistence modules. They occur naturally in applications when the input data are 
known only up to some bounded error. After presenting the basic properties, we give a clean 
proof of the technical lemma (from [7]) that two interleaved modules can be interpolated by 
a 1-parameter family. 

Section [4] is devoted to the isometry theorem, which asserts that the interleaving distance 
between two persistence modules is equal to the bottleneck distance between their persis- 
tence diagrams. The two inequalities that comprise this result are treated separately. One 
direction is the celebrated stability theorem of [TU]. The more recent converse inequality ap- 
pears in [T5]. We formulate the stability theorem as a statement about measures and their 
diagrams. The proof of this more abstract result closely follows the original proof in [TO] . 

Our version of the isometry theorem supposes that the persistence modules are 'q-tame'. We 
consider this to be the natural realm of the theorem. We also prove a more general version 

headers who wish to adopt our notation are invited to contact us for the M^X macros. 
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of the stability theorem which allows us to compare diagrams of persistence modules with 
no assumptions on their tameness: wherever the two diagrams are defined, they must be 
close to each other. 

Given the length of this paper, the reader may wonder if the framework developed here is 
truly simpler than existing approaches. In section [5j we give examples of how to use the 
results and ideas in this paper. 

Related Work. The early history of persistence is concerned with the quantity 

r* = rank(H(X s ) - E(X t )) 

for an object X represented at two different scales s, t, and where H is homology. This 
appeared in the early 1990s in the work of Frosini [H], with different notation and under 
the name 'size function'. Independently, a few years later, Robins [17] introduced the term 
'persistent Betti numbers' for quantities of the form r^ +p , and noted their stability with 
respect to Hausdorff distance. 

The modern theory of persistence is built on three central pillars. 

• The persistence diagram, and an algorithm for computing it, were introduced by Edels- 
brunner, Letscher and Zomorodian [T3] . 

• Zomorodian and Carlsson [20] defined persistence modules, indexed by the natural 
numbers and viewed as graded modules over the polynomial ring k[t]. 

• Cohen-Steiner, Edelsbrunner and Harer [10] proved the stability theorem. 

All three papers make strong finiteness assumptions about the starting data. In [7J, the 
results are generalised to persistence modules parametrised over the real line, under the 
assumption that r* < go for s < t. The present paper extends that work, with many new 
concepts and proofs. 

We draw attention to two recent papers which share our goal of understanding continuous- 
parameter persistence modules. Lesnick [TB] gives an extensive algebraic treatment of mod- 
ules over one or more real parameters. The converse stability inequality (and hence the 
isometry theorem) appears for the first time there. Bubenik and Scott |3j develop the 
category-theoretical view of persistence modules. 

Both [16] and [3] appeared during the writing of this paper. There is a fair amount of 
overlap between the three papers, most of it reached independently, but not quite all: from 
Lesnick [16] we learned of the results of Webb [19J, which resolved a sticking-point for us. 

Multisets 

Persistence diagrams are multisets rather than sets. For our purposes, a multiset is a pair 
A = (5, m) where S is a set and 

m: S {1,2,3,...} u {oo} 
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is the multiplicity function, which tells us how many times each element of S occurs in A. 
Here are our conventions regarding multisets. 

• The cardinality of A = (S, m) is defined to be 

card A = ^ m(s) 

seS 

which takes values in {0, 1,2,... }u{oo}. We do not distinguish between different infinite 
cardinals. 

• We never form the intersection of two multisets, but we will sometimes restrict a mul- 
tiset A to a set B: 

A|b = (S n B,m\ SnB ) 
We may write this as A n B when A|b is typographically inconvenient. 

• A pair (B,m) where 

m : B -» {0,1,2,...} u {oo} 

is implicitly regarded as defining a multiset A = (S, m|s) where S = B — m _1 (0) is the 
support of m. 

• If A = (S, m) is a multiset and / : S — > B where B is a set, then the notation 

{f(a)\asA} 

is interpreted as the multiset in B with multiplicity function 

m'(6) = £ m(s) 

f-Hb) 

Except in definitions like these, we seldom refer explicitly to S. 



1. Persistence Modules 
All vector spaces are taken to be over an arbitrary field k, fixed throughout the paper. 

1.1. Persistence modules over a real parameter. A persistence module V over the real 
numbers R is defined to be an indexed family of vector spaces 

(V t \teR), 

and a doubly-indexed family of linear maps 

(vl : V s - V t | s < t) 

which satisfy the composition law 

i>* o v s = v l 

whenever r < s < t, and where v\ is the identity map on V t . 
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Remark. Equivalent ly, a persistence module is a functor from the real line (viewed as a 
category with a unique morphism s — » t whenever s < t) to the category of vector spaces. 
The uniqueness of the morphism s — > t corresponds to the fact that all possible compositions 

V 1 O V s "' 1 O • • • O V s ' 2 O V Sl 
s n -l s n -2 u si u s 

from V s to Vt are equal to each other, and in particular to v\. 



Here is the standard class of examples from applied topology Let X be a topological space 
and let / : X — > R be a function (not necessarily continuous). Consider the sublevel sets: 

X t =(X,f) t = {xeX\f(x)^t} 

The inclusion maps 

(i* : X s — X* | s < t) 
trivially satisfy the composition law 

i* o = i* 

whenever r < s < t, and is the identity on X*. Collectively this information is called the 
sublevelset filtration of (X, /) and may be denoted X su b or Xg ub . 

We obtain a persistence module by applying any functor from topological spaces to vector 
spaces. For example, let H = H p (— ; k) be the functor 'p-dimensional singular homology with 
coefficients in k'. We define a persistence module V by setting 

V t = H(X'), 

and 

vl = H(iJ) : H(X S ) - H(X') 
(the maps on homology induced by the inclusion maps). More simply: 

V = H(X sub ) 



In the applied topology literature, there are many examples (X, /) whose persistent homology 
is of interest. Very often X is a finite simplicial complex and each X* is a subcomplex. It 
follows that the vector spaces H(X*) are finite-dimensional; and as t increases there are 
finitely many 'critical values' at which the complex changes, growing by one or more new 
cells. Suppose these critical values are 

a\ < 0,2 < ■ ■ • < a n . 

Then all the information in the persistence module is contained in the finite diagram 

H(X ai ) -» H(X a2 ) — ► H(X a ") 

of finite-dimensional vector spaces and linear maps. In this situation, 

• the isomorphism type of H(X sub ) admits a compact description [T3| 120]: 

• there is a fast algorithm for computing this description [131 [2D] : 

• the description is continuous (indeed 1-Lipschitz) in / [TO] . 
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This description is the famous persistence diagram, or barcode. 



In practical applications all the persistence modules that we encounter are finite, for the 
trivial reason that a computer only stores finite data. However, there are good grounds 
for extending the results of [T5| I2TJI [TU] to a more general setting. For example, theoretical 
guarantees are commonly formulated in terms of an idealised model, such as a continuous 
space to which the data form an approximation. Finiteness becomes unnatural and difficult 
to enforce in these ideal models, but one still wants the main results to be true. 



Following [7], we say that a persistence module V is tame if 

r* = rank(w*) < go whenever s < t. 

Since the word 'tame' is overloaded with too many meanings in the persistence literature, 
we offer q-tame as an alternative (for reasons that will be explained later). 

It is shown in [7J that persistence diagrams can be constructed for q-tame persistence mod- 
ules, and that these diagrams are stable with respect to certain natural metrics. We repro- 
duce these results here, using different methods for many of the arguments. We complete 
the picture by showing that the map from tame persistence modules to persistence diagrams 
is an isometry. This isometry theorem is due independently to Lesnick [16J. 



We believe that q-tame persistence modules are the 'right' class of objects to work with. 
This is for two complementary reasons: (i) we can prove almost everything we want to prove 
about tame modules and their persistence diagrams; and (ii) they occur in practice. For 
example, a continuous function on a finite simplicial complex has tame sublevelset persistent 



homology (Theorem 2.22). See jH] for many other examples. 



1.2. Different index sets. We can define persistence modules over any partially ordered 
set T, formally in the same way as for R, by specifying indexed families 

(V t \te T) and («* | s, t e T, s < t) 

of vector spaces and linear maps, for which v\ = v\ o v s r whenever r < s < t, and where v\ is 
the identity on V t . The resulting collection of data is called a T-persistence module or a 
persistence module over T. 

If V is a T-persistence module and ScT, then we get an S-persistence module by consid- 
ering only those spaces and maps with indices in S. This is called the restriction of V to S, 
and may be written Vs- 

Most commonly, we work with locally finite subsets T c R, that is, those subsets with no 
accumulation points in R. We collect information about an R-persistence module by con- 
sidering its restriction to finite or locally finite subsets. This works well because persistence 
modules over {1,2, ... ,n} or over the integers Z are well understood. 

In section |3j we will consider some other posets. 
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1.3. Module categories. A homomorphism $ between two T-persistence modules U, V is 
a collection of linear maps 

(fa : U t - V t | te T) 

such that the diagram 



U. 



■> V 



V s ■ 

commutes for all s < t. Composition is defined in the obvious way, as are identity homo- 
morphisms. This makes the collection of persistence modules into a category. 

The category contains kernel, image, and cokernel objects for every map $, and there is a 
zero object. 

Write 

Hom(U, V) = {homomorphisms U — > V}, 
End(V) = {homomorphisms V —> V}. 
Note that End(V) is a k-algebra. 

Later we will consider homomorphisms that shift the index. We will introduce these when 
we consider the interleaving relation between persistence modules. 



1.4. Interval modules. The basic building blocks of persistence are the interval modules. 
One seeks to understand a persistence module by decomposing it into intervals. This is not 
always possible, but it is possible often enough for our purposes. 

Let T cr R ; and let J c T be an interval]^] Then I J is defined to be the T-persistence 
module with spaces 

k if t e J 

otherwise 



and maps 



1 if s, t e J 
otherwise 



In informal language, I J represents a 'feature' which 'persists' over the interval J but is 
absent elsewhere. We write 1^ when we wish to specify the index set unambiguously. 

We now establish notation for describing interval modules. 

If T is a locally finite subset of R, then any bounded interval contains its endpoints. We 
write these as closed intervals, and unbounded intervals as open or half-open intervals. For 
example, in the case of Z-persistence modules there are four kinds of interval module: 

j[m,n] jr(— oo,n] jj-[m,+oo) j(-oo,+oo) 



By this we mean that if r < s < t are elements of T with r, t e J, then s e J. 
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For visual clarity, we sometimes lower the superscripts, writing 

I[m,n] I(— oo, n] I[m, +oo) I(— go, +oo) 

instead. 

For persistence modules over R, it becomes important to distinguish intervals which have 
the same endpoints but different topology (open, closed, half-open). For this purpose we 
introduce decorated real numbers, which are written as ordinary real numbers but with 
a superscript + (plus) or ~~ (minus). For finite intervals we adopt the following dictionary: 



[P ! 


Q-) 


means 


\p,q) 




q + ) 


means 


\p,q] 


lP + : 


Q-) 


means 


(p,q) 


lP + : 


q + ) 


means 


(p,q\ 



We require p < q, except for the special case (r ,r + ], which represents the 1-point interval 
[r, r] . 

We also include the symbols — oo and +00, for infinite intervals. Since real intervals are always 
open at infinity these implicitly carry the superscripts — 00 + and +oo~, but we usually omit 
the superscripts. The interval notation extends in the obvious way, so 

(— 00, g + ] means (—00,9] 

for instance. 

When we wish to refer to a decorated real number but we don't know what the decoration 
is, we use an asterisk. Thus p* means p + or p~ . The notation for an arbitrary interval is 
[p* 1 q*}) where p* < q* in the obvious ordering on decorated reals. 

Here are some visual conventions for representing interval modules over R. 

We work in the half-plane 

IK = {(p,q) \ p^q} 

of points in R 2 which lie on or above the diagonal. Any finite interval module l[p* , q*) may 
be represented in several different ways (see Figure [I]) : 

• as an interval in the real line; 

• as rank(i*), viewed function "K -> {0, 1}; 

• as a point (p, q) in "K, with a tick to specify the decoration. 

Here are the four tick directions explicitly: 

{p-,q + ) = [p + ,q + ] = 4 

ip~,q~) = p [p + ,q~) = «s 

The convention is that the tick points into the quadrant suggested by the decorations. 
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Figure 2. The extended half-plane "K with examples of each type of interval, 
drawn as points with ticks. 

A fourth option is to draw the point (p, q) without indicating the decoration. This 'forget- 
ful' representation is the classical convention for persistence diagrams, and is adequate for 
most purposes. However, the extra precision provided by decoration is important for the 
correspondence between diagrams and measures. 

To represent an infinite interval as a (decorated or undecorated) point, we work in the 
extended half-plane 

CK = 'K u {-00} x R u R x {+00} u {(-00, +00)} 

which can be drawn schematically as a triangle. See Figure [2] 

1.5. Interval decomposition. The direct sum W = U © V of two persistence modules 
U, V is defined as follows: 

W t = U t ®V t , w*=u*0vi 
11 



This generalises immediately to arbitrary (finite or infinite) direct sums. 



A persistence module W is indecomposable if the only decompositions W = U©V are the 
trivial decompositions W © and © W. 

Direct sums play both a synthetic role and an analytic role in our theory. On the one hand, 
given an indexed family of intervals ( J<? | £ e L) we can synthesise a persistence module 

V = 0I J< 

£eL 

whose isomorphism type depends only on the multiset { Jt \ I s L). In light of the direct-sum 
decomposition, we can think of V as having an independent feature for each I e L, supported 
over the interval J(. On the other hand, we can attempt to analyse a given persistence module 
V by decomposing it into interval modules. 

We now present the necessary theory for this. A 'building block' in a module category can 
be characterised by having a comparatively simple endomorphism ring. Interval modules 
have the simplest possible: 

Proposition 1.1. Let 1 = 1^ be an interval module over T c R ; - then End(I) = k. 

Proof. Any endomorphism of I acts on each nonzero I t = k by scalar multiplication. By the 
commutation law, it is the same scalar for each t. □ 

Proposition 1.2. Interval modules are indecomposable. 

Proof. Given a decomposition I = U© V, the projection maps onto U and V are idempotents 
in the endomorphism ring of I. But the only idempotents of End(I) = k are and 1. □ 

Theorem 1.3 (Krull-Remak-Schmidt-Azumaya). Suppose that a persistence moduleY over 
T c R can be expressed as a direct sum of interval modules in two different ways: 

V = ®I J< = ® l Km 

£eL meM 

Then there is a bisection a : L —*■ M such that Ji = K a m for all t. 

Proof. This is from Azumaya [2] (Theorem 1), plus the trivial observation that I J = I 
implies J = K. The theorem requires a certain condition on the endomorphism ring of 
each possible interval module: if a, e End(I) are non-isomorphisms then a + (3 is a non- 
isomorphism. Since each End(I) = k, the only non-isomorphism is the zero map and the 
condition is satisfied. □ 

In other words, provided we can decompose a given persistence module V as a direct sum of 
interval modules, then the multiset of intervals is an isomorphism invariant of V. But when 
does such a decomposition exist? 

Theorem 1.4 (Gabriel, Auslander, Ringel-Tachikawa, Webb). LetY be a persistence module 
over T c R. In each of the following situations, Y can be decomposed as a direct sum of 
interval modules: 
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(1) T is a finite set. 

(2) T is a locally finite subset of R and each Vt is countable- dimensional. 

On the other hand, (3) there exists a persistence module over Z (indeed, over the nonpositive 
integers) which does not admit an interval decomposition. 

We are grateful to William Crawley-Boevey for clarifying the situation for us, and for intro- 
ducing us to the module with no interval decomposition. We acknowledge Lesnick [16] for 
drawing our attention to Webb [T9] . 



Proof. (1) The decomposition of a diagram 

Vi -> V 2 -> ► V n 

into interval summands, when each dim(Vi) is finite, is one of the simpler instances of 
Gabriel's theorem [15]; see [20J or |4J for a concrete explanation. The extension to infinite- 
dimensional modules follows abstractly from a theorem of Auslander [1] and, independently, 
Ringel and Tachikawa [18j . Alternatively, note that the argument given in [I] does not really 
require finite-dimensionality. 

(2) We may assume that T = Z because any locally finite subset of R is equivalent as an 
ordered set to a subset of Z. Then this is a theorem of Webb [19]. 

(3) Webb [19] gives the following example, indexed over the nonpositive integers: 

Vo = {sequences (xx,X2, %3, ■ ■ ■ ) of real numbers} 
V- n = {sequences with x± = ■ ■ ■ = x n = 0} for n ^ 1 

The maps vZm are the inclusions V- m c V_ n (m ^ n). 

Suppose V has an interval decomposition. Since each map vZ™-i is injective, all of the 
intervals must be of the form [— n, 0] or (— oo,0]. Since dim(V_ ri /Vl n _i) = 1, each interval 
[— n, 0] occurs with multiplicity 1. Since H^ 7 -™ = {0}; ^ ne interval (— oo,0] does not occur 
at all. 

This would imply that V = ® n>0 ~^~ n ' ^- This contradicts the fact that dim(Vo) is uncount- 
able. Therefore V does not admit an interval decomposition after all. □ 



For a persistence module which decomposes into intervals, the way is now clear to define 
its persistence diagram. Simply record which intervals occur in the decomposition (with 
multiplicity): see section 1.6 Theorem 1.3 tells us that this is an isomorphism invariant. 



However, we have seen that arbitrary persistence modules over R are not guaranteed an 
interval decomposition. Here are three ways around the problem: 

• Work in restricted settings to ensure that the structure of V depends only on finitely 
many index values teR. For example, if X is a compact manifold and / is a Morse 
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function, then H(X su b) is determined by the finite sequence 



H(X ai ) -» H(X a2 ) — > H(X a ") 

where a±,a2,- ■ ■ , a n are the critical values of /. This is the traditional approach. In 
this setting, the word 'tame' is often used to demarcate pairs (X, f) for which H(X sub ) 
is determined by a finite diagram of finite-dimensional vector spaces. 

Sample the persistence module V over a finite grid. Consider limits as the grid converges 
to the whole real line. This is the approach taken in [7], where it was shown that the 
q-tame hypothesis is sufficient to guarantee good limiting behaviour. 

Show that the persistence intervals (in the decomposable case) can be inferred from the 
behaviour of V on short finite index sets. Apply this indirect definition to define the 
persistence diagram in the non-decomposable case. This is the method of 'rectangle 
measures' developed in this paper. 



Although Theorem 1.4 doesn't cover it, it remains possible that there is a decomposition 
theorem for q-tame modules (in fact, it seems somewhat likely). Such a theorem would 
remove the impetus for some of the arguments in this paper. 



1.6. The persistence diagram of a decomposable module. If a persistence module V 
indexed over R can be decomposed 

V s 

then we define the decorated persistence diagram to be the multiset 

Dgm(V) = lnt(V) = {(^,g;)|£eL} 
and the undecorated persistence diagram to be the multiset 

dgm(V) = int(V) = {(p t , q t ) \ i e L) A 
where A = {(r, r) | r e R} is the diagonal in the plane. 



Remark. In section [27T| we will give a quite different definition of the persistence diagram of V, 
based on the persistence measure rather than the interval decomposition. When occasionally 
we must distinguish between the two notions, we use the alternate names Int, int rather than 
Dgm,dgm for the diagrams defined here. 



Theorem 1.3 implies that Dgm(V) and dgm(V) do not depend on the decomposition of V. 



Notice that Dgm is a multiset of decorated points in !K, whereas dgm is a multiset of undec- 
orated points in the interior of "K. Here 'interior' means that we exclude the diagonal but 
keep the points at infinity. The information retained by dgm is precisely the information we 
care about later, when we discuss bottleneck distances. See section |4j 



Let us see how these definitions play out in an example of the traditional kind. 
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Figure 3. A traditional example in persistence theory: (left) X is a smoothly 
embedded curve in the plane, and / is its y-coordinate or 'height' function; 
(right) the decorated persistence diagram of H(X su b). There are three intervals 
in H and one interval in Hi. 



Example 1.5. Consider the curve in R 2 shown in Figure [3j filtered by the height function. 
The sublevelset persistent homology decomposes into half-open intervals as follows: 

H (X sub ) s I[aT, +oo] 1(6- , c~) 1[dT, e~) 
Hi (X sub ) s +oo] 



If we let [x] denote the chain corresponding to the critical point with critical value x, then the 
three summands of Ho can be taken to be generated by [a], [b] — [a], and [d] — [a] respectively. 
For instance, the independent 0-cycle [b] — [a], which is born at time b, becomes a boundary 
at time c. Thus it gives rise to the half-open interval [b, c) = [b~, c~). 

Remark. For a Morse function on a compact manifold with criti cal values (ai), the intervals 
are always half-open, of type [a^ a^) = [a^,aj). See section 



2.9 



1.7. Quiver calculations. We now set up the notation and algebraic tools for handling 
persistence modules over a finite index set. 

A persistence module V indexed over a finite subset 

T : a\ < <i2 < • • • < a n 
of the real line can be thought of as a diagram of n vector spaces and n — 1 linear maps: 



V: V r , 



V, 



a 2 



v an 



Such a diagram is a representation of the following quiver: 



We have seen (Theorem 1.4) that V decomposes as a finite sum of interval modules I[aj, a,]. 
When n is small, we can represent these interval modules pictorially. The following example 
illustrates how. 
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Example 1.6. Let a < b < c. There are six interval modules over {a, b, c}, namely: 
I[a,a] = » a — o b — o c I[a,b] = » a — o c I[a,c] = • a —m b —m c 
b] = o a — • b — o c l[b, c] = o a — « 6 — « c 

I[C, C] = O a — O b — « c 

In the notation we use filled circles • to indicate where the module has rank 1, and clear 
circles o to indicate where the module has rank 0. The connecting maps have full rank. 



Now let V be a persistence module indexed over R. For any finite set of indices 

T : ai < ci2 < • • • < a, n 

and any interval [a^a,] c T, we define the multiplicity of [a^a,] in Vt to be the number 
of copies of I[a,, aj] to occur in the interval decomposition of Vt- This takes values in the 
set {0, 1, 2, ... , go}. (We do not distinguish different infinite cardinals.) 

It is useful to have notation for these multiplicities. Again, we define by example. 
Example 1.7. We write 

([b,c\\Y a ,b,c) or <o a — « 6 — « C |V> 
for the multiplicity of o a — » b — « c in the 3-term module 

v aAc = (K — H — v c ). 

When V is clear from the context, we may simply write 

The abbreviation ([b, c]) is not permitted since it is ambiguous. For example, ([6, c] | V& )C ) 



1.11 



and ([b,c] \ V^& )C ) are not generally the same. See Proposition 1.10 and Example 

Example 1.8. The invariants of a single linear map V a V b are: 

rank(f ) = <« a — » b \ V> 
nullity (v) = <» a — °b I V> 
conullity(f ) = (o a — » b \ V) 
(The co nullity is the dimension of the cokernel.) 

Proposition 1.9 (direct sums). Suppose a persistence module V can be written as a direct 
sum 

V= 0V^ 

Then _ 

([ai,aj] | V T > = 2<Ka,] | V T > 

for any index set T = {ai, a 2 , . . . , a n } and interval [ai, aj] c T. 

Proof. Each summand V T can be decomposed separately into interval modules. Putting 
these together we get an interval decomposition of Vt- The number of summands of a given 
type in Vt is then equal to the total number of summands of that type in all of the V T - □ 
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Often we wish to compare multiplicities of intervals in different finite restrictions of V. The 
principle is very simple: 

Proposition 1.10 (restriction principle). Let S,T be finite index sets with ScT. Then 

<I | V s > = £<J I V T > 

3 

where the sum is over those intervals J c= T which restrict over S to I. 

Proof. Take an arbitrary interval decomposition of Vp- This induces an interval decomposi- 
tion of Vs. Summands of V s of type I arise precisely from those summands of V T of types 
J as above. □ 

Example 1.11. Let a < b < c. Consider new indices p and q, arranged a<p<b<q<c. 
Then 

(°«- -*c> = (°a- *q— *c> 

whereas 

(°«- -«6- -*c> = <°a— °p— «6- -«c> + (°a— •p— *b~ -«c>- 

The extra term occurs when the inserted new index occurs between a clear node and a filled 
node, because then there are two possible intervals which restrict to the original interval. 

We will make frequent use of the restriction principle. Here is a simple illustration, to serve 
as a template for similar arguments that we will encounter later on. 

Example 1.12. Consider the standard fact that rank(V& — » V c ) ^ rank(V a — > Vd) when 
a < 6 < c ^ d. The proof using quiver notation runs as follows: 

rank(V r 6 V c ) = < •&— « c > 

= {•a — »b — * c — + three other terms 

> <»a— *c— •d) 

= (•a— -«d> 

= rank(K - V d ) 

The 'three other terms' are 

<°a •& *c «d>, <«a «6 *c °d), (°a *6 *c °d> 

as indicated by the restriction principle. 



2. Rectangle Measures 



For a decomposable R-persistence module 

we have defined the decorated persistence diagram to be the multiset 

Dgm(V) = {(pJ,<£)K6L}, 

17 



and the undecorated persistence diagram to be the multiset 

dgm(V) = {(p t ,qt) \ ieL}. 



If we don't know that V is decomposable, then we have to proceed differently. The rough 
idea is that if we know how many points of Dgm are contained in each rectangle in the upper- 
half space, then we know Dgm itself. For persistence modules, counting points in rectangles 
turns out to be easy. 

The language of measure theory is well suited to this argument. We will show that each 
persistence module defines an integer-valued measure on rectangles. If the module is tame 



then this measure is finite- valued, and therefore (Theorem 2.8) it is concentrated at a discrete 
set of points: this is the persistence diagram. 

In the decomposable case, we will see that this agrees with the definition above. And when 
the module is not known to be decomposable, we can proceed regardless. 

The persistence measures that we construct are not true measures on subsets of R 2 : they 
are additive in the sense of tiling rather than in the usual sense of disjoint set union. The 
discrepancy arises when we split a rectangle into two: what happens to the points on the 
common edge? To which rectangle do they belong? In resolving this, one is naturally led 
to the notion of decorated points. This fits perfectly with our earlier use of decorations to 
distinguish open and closed interval ends. 

2.1. The persistence measure. Let V be a persistence module. The persistence measure 
of V is the function 

Hv(R) = (o a — o d | V> 

defined on rectangles R = [a, b] x [c, d] in the plane with a < b < c < d. 

For a decomposable persistence module, there is a clear relationship between /zy and the 
interval summands of V. Let us first consider the case of an interval module. 

Proposition 2.1. Let V = I J where J = [p*,q*] is a real interval. Let R = [a,b] x [c,d] 
where a < b < c < d. Then 

1 if [b, c] c J c (a, d) 
otherwise 



Hv{R) 



Proof. It is clear that I J restricted to {a,b,c,d} is an interval or is zero. Thus, /iy(-R) ^ 1- 
Moreover /iy(-R) = 1 precisely when 

which happens if and only if b, c e J and a,d $ J. This is equivalent to the condition 
[b,c] c J c ( a ,d). □ 



Proposition 2.1 has a graphical interpretation. Represent the interval J c R as a decorated 
point in the extended plane. The following picture indicates exactly which decorated points 
[p*,q*] are detected by //y(-R): 
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< X > 



If (p, q) is in the interior of R then [p*,q*] is always detected regardless of the decoration. 
If (p, q) is on the boundary, then [p*, q*) is detected if the tick is directed inwards. 

We formalise this by defining a membership relation between decorated points and rectangles. 

Definition. Let R = [a, b] x [c, d] where a < b < c < d, and consider a decorated point 
(p*, q*) with p* < q*. We write (p*,q*) s R if 

(i) the interval J = [p*, q*) satisfies [b, c] c J c (a, (i); 
or equivalent ly 

(ii) the point (p, q) and its decoration tick are contained in the closed rectangle R. 
We use the notation 

R' = {(p*,q*)eR} 

when we wish to explicitly name the set of decorated points contained in R. However, we 
prefer to write (p*,q*) £ R instead of (p*,q*) e R* when we are simply indicating the 
relationship between a decorated point and a rectangle. 

With this understanding we state the following counting result. 
Corollary 2.2. Suppose V is a decomposable persistence module over R: 

v = ©%*,<?;] 

Then: 

(*) MR) =card(Dgm(V)| i? ) 



Proof. This follows immediately from Proposition 2.1 and Proposition 1.9 (direct sums). □ 



We can now articulate our strategy for defining the persistence diagram without assuming 
that the module V is decomposable: 

• construct the persistence measure /xy; 

• let Dgm(V) be a multiset in the half-plane such that Q holds for all rectangles R. 

To make this work, we need to know that such a multiset exists and is unique. This is the 



content of Theorem |2.8[ under the hypothesis that /xy is finite and additive. The result is a 
sort of 'Riesz' representation theorem for measures on rectangles. 
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When V is decomposable, Corollary 



2.2 



confirms that our new definition agrees with the old. 



2.2. The persistence measure (continued). We call /xy a measure because it is additive 
with respect to splitting a rectangle into two rectangles. We prove this shortly. First, we 
give a new proof of an 'alternating sum' formula for /iy(-R) that appears in j 10 . 

Proposition 2.3. Let V be a persistence module, and let a < b < c < d. If the spaces V a , 
Vb, V c , Vd are finite- dimensional, or less stringently ifr b < go, then 

<°a — — « c — °d | V> 



r ; , - 1\ . 



4 + i d a 



(Here as before r* = rank(w* : V s — > V*).; 



Proof. Decompose the 4-term module Y a ^,c,d into intervals. The left-hand side counts inter- 
vals of type [b, c]. By the restriction principle, the four terms on the right-hand side evaluate 
as follows: 



4 =(°a- 


-•b— •c— °d) + (•a— •&— •c— °d) ~ 


h <o a - 


-*6— •c— *d> + <*a- 


-•6— «c— *d> 




(•a •c— °d) 




+ <«a" 


-•b •c— •d) 


4 = 




<°a- 


-•6 »c «d> + <»a- 


-•b •c— •d) 


4 = 






<••- 


-•b •c— *d) 



These expressions are all finite: the hypothesis rg < go implies that the other three ranks 
are finite too (Example 1.12). We can legitimately take the alternating sum, whereupon all 
terms cancel except for the (o a — m b — • c — o d ). □ 



We give three proofs of additivity. The first is completely general, whereas the other two 
work under restricted settings but are illuminating in their own way. 

Proposition 2.4. /jy is additive under vertical and horizontal splitting, meaning that 
fiv{[a,b] x [c, d]) = ny([a,p] x [c,d]) + /x v (b, b] x [c,d]) 
H\{[a,b] x [c,d]) = fjLy([a,b] x [c,q]) +/i V ([a,6] x [q,d]) 

whenever a<p<b^c<q<d. 

This additivity property is illustrated by the following figure 






where the claim is that fiy(R) = fiy(S) + fiv(T) = fiv{U) + [iy(V). 
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First proof. Let a<p<b^c<q<d. Then we calculate 
fiy([a,b] x [c,d]) = <o a — « c — o d > 

= ( °a *p •& *c °d) + < °a °p 

= (°a — # p— — *c — °d> + (- —Op — m b - 
= H\[[a,p] x [c,d]) +/x v ([p,6] x [c,d]) 

for additivity with respect to a horizontal split, and 

fiy{[a,b] x [c,d]) = < o a — » c — — o d > 

= ( °a •& *c °g > + ( °a 



= fiy([a, b] x [c, q]) + fiy([a, b] x [q, d]) 
for additivity with respect to a vertical split. 



-°d) 



-°d 
-°d 



□ 



Second proof, assuming r£ < go. The alternating sum formula (Proposition 2.3) gives 



and 



c c 



r h + r„ 



+ rf ) + (rg 



r^-r„-r^ + r* + r. 



r,, + r„ 



4 + 



as required. Note that r£ < oo implies that r^,r^ < oo, so the formula is valid for all the 



rectangles involved. 



□ 



This second proof is particularly transparent when drawn geometrically in the plane: the + 
and — signs at the corners of the rectangles cancel in a pleasant way. 



Third proof, assuming V is decomposable. By Corollary |2.2 , the measure of a rectangle is 
equal to the number of interval summands whose corresponding decorated points lie in the 
rectangle. Additivity now follows from the elementary observation that a decorated point 
in R belongs to exactly one of S and T, and to exactly one of U and V. □ 



We finish this section with two further descriptions of fiy([a, b] x [c, d]). 
Proposition 2.5. We have the following formulae: 

im(t>£) n ker(vf) 



<°a- 



-°d | V) = dim 
= dim 



im(v°) n ker(u^) 
ker(i^) 



ker(u£) + im(t^) n ker(t>£ 



Proof. This is covered, for instance, in the localisation discussion in section 5.1 of [4J. The 
two formulae are obtained by localising at c, b respectively. □ 
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Proposition 2J3 expresses the measure of a rectangle as the dimension of a vector space 
constructed functorially from V. (Ostensibly there are two vector spaces, one for each 
formula, but the map v% induces a natural isomorphism between them.) The functoriality 
has its uses, but in other regards this characterisation is quite hard to use. For instance, 
additivity is not at all obvious in this formulation. 



2.3. Abstract r-measures. We now consider rectangle measures more abstractly. Persis- 
tence measures are of course our primary example, but the general formulation allows for 
many other situations. 

For ease of exposition, we initially work in the plane R 2 rather than the extended plane R 2 . 
The picture is completed in section 2.6 when we discuss the points at infinity. 

Definition. Let D be a subset of R 2 . Define 

Rect(D) = {[a, b] x [c, d] a T> \ a < b and c < d] 

(the set of closed rectangles contained in T>). A rectangle measure or r-measure on D is 
a function 

li : Rect(D) — {0, 1, 2, . . . } u {oo} 
which is additive under vertical and horizontal splitting (as in Proposition 2.4). 

Proposition 2.6. Let fi be an r-measure on D c R 2 . Then fi is: 

(Finitely additive) If Re Rect(D) can be written as a union R = R\ u • • • u of rectangles 
with disjoint interiors, then n{R) = fi(Ri) + • • • + /i(Rk)- 

(Monotone) If R c S then fi(R) «S fi(S). 



Proof. (Finitely additive) Let R = [a,b] x [c,d]. By induction and the vertical splitting 
property, it follows that finite additivity holds for decompositions of the form 

R = \jR l 

i 

where Ri = [a«, Oi+i] x [c, d] with a = a% < a,2 < ■ • • < a m = b. 

By induction and the horizontal splitting property, it then follows that finite additivity holds 
for 'product' decompositions 

R = [a, b] x [c, d] = [^J R^ 

where Rij = [at, a i+1 ] x [cj, Cj+i] with a = a x < a 2 < ■ ■ ■ < a m = b and c = c\ < c 2 < • • • < 
c n d. 

For an arbitrary decomposition R = Ri u • • • uRk, the result follows by considering a product 
decomposition of R by which each Ri is itself product-decomposed. 

(Monotone) Decompose S into a collection of interior disjoint rectangles R,R\,..., Rk-i, one 
of which is R. (This can be done with at most 9 rectangles using a product decomposition.) 
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Then 

fi(S) = n(R) + + • • • + n(Rk-i) 

by finite additivity and the fact that // ^ 0. □ 

Here is one more plausible-and-true statement about abstract r-measures. 

Proposition 2.7 (Subadditivity). Let p be an r-measure on T> c R 2 . If a rectangle R e 
Rect(D) is contained in a finite union 

c R lKJ ■ ■ ■ u R k 

of rectangles Ri e Rect(2)) 7 then 

fi{R) ^fi(R 1 ) + --- + fi(R k ). 

Proof. Let 

cli < a 2 < ■ ■ ■ < a m 
include all the ^-coordinates of the corners of all the rectangles, and let 

Ci < c 2 < • • • < c n 

include all the y-coordinates. Each rectangle is then tiled as a union of pieces 

[ai,a i+1 ] x [9,9+1] 

with disjoint interiors, and the measure of the rectangle is the sum of the measures of its 
tiles, by additivity. Since each tile belonging to R must also belong to one or more of the Ri, 
the inequality follows. □ 

2.4. Equivalence of measures and diagrams. We wish to establish a correspondence 
between r-measures and decorated diagrams. The task of defining a continuous persistence 
diagram can then be replaced by the simpler task of defining an r-measure. For this to work, 
the measure has to be finite. 

A useful notion is the r-interior of a region T> c R 2 , defined 

V = {(p*,Q*)\ there exists R e Rect(D) such that (p*, q*) £ R} ■ 

This is the set of decorated points that are 'seen' by the rectangles in T>. The decorated 
diagram will be a multiset in D a . Clearly an r-measure cannot tell us what happens out- 
side D". 

The interior of T> in the classical sense is written D°. It may be defined 

D° = {(p, q) I there exists R e Rect(D) such that (p, q) 6 R°} , 

supposing that we already agree that the interior of a closed rectangle R = [a, b] x [c, d] is 
the open rectangle R° = (a, b) x (c, d). The undecorated diagram will be a multiset in D°. 

Theorem 2.8 (The equivalence theorem). Let T> c R 2 . There is a bijective correspondence 
between: 
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• Finite r-measures \x on D. Here 1 finite' means that fi(R) < go for every R e Rect(D). 

• Locally finite multisets A in D u . Here 'locally finite' means that card(A|^) < go for 
every R e Rect(D). 

The measure \i corresponding to a multiset A is related to it by the formula 
(2.9) ii{R) = card(A| fi ) 

for every R e Rect(D). 

Remark. We can write equation ( 2.9[ ) equivalently as 



(2.10) (i(R)= J] m(p*,q*), 

(p*,q*)eR 

where 

m : D" -> {0,1,2,...} 

is the multiplicity function for A. 

The theorem leads immediately to the following definitions. Let /i be a finite r-measure on 
a region D a R 2 . 

(i) The decorated diagram of fi is the unique locally finite multiset Dgm(/i) in D m such 
that 

fi(R) = card(Dgm(/i)| fl ) 

for every R e Rect(D). 

(ii) The undecorated diagram of \i is the locally finite multiset in T>° 

dgm(/i) = {(p,q) | (p*,q*) e Dgm( M )} n D° 
obtained by forgetting the decorations on the points and restricting to the interior. 

Remark. Note that dgm is locally finite in D°, but not necessarily locally finite in R 2 — it 
may have accumulation points on the boundary of D. 

Proof of Theorem \2.8[ One direction of the correspondence is easy. If A is a multiset on T>' 



then the function /i(-R) on rectangles defined by equation (2.9) is indeed an r-measure. It is 
finite if A is locally finite. To verify additivity, suppose that a rectangle R is split vertically 
or horizontally into two rectangles i?i,i?2- Notice that every decorated point (p*,q*) £ R 
belongs to exactly one of Ri,R 2 . It follows that 

fi(R) = card(A|/j) = card(A|/j 1 ) + card(A|ij 2 ) = fi(R\) + /i(-??2), 

as required. 

The reverse direction takes more work. Given an r-measure [i we will (1) construct a multiset 
A in D", (2) show that /i and A are related by equation (2.9), and (3) show that A is unique. 



In practice we will work with the multiplicity function m and equation (2.10), rather than 
referring to A directly. 
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Step 1. Let \l be a finite r-measure on T>. For (p*,q*) in T> a , define 

(2.11) m(jo*,g*) = mm{fx(R) \ Re Rect(D), (p*,q*) e R} . 

Note that the minimum is attained because the set is nonempty and \x takes values in the 
natural numbers. 

Here is an alternative characterisation. Rather than minimising over all rectangles, we can 
take the limit through a decreasing sequence of rectangles: 

Lemma 2.12. Let (&) and (rji) be non-increasing sequences of positive real numbers which 
tend to zero as i — > go. Then 



and similarly 



m(p + ,q + ) = lim fi(\p,p + &] x [q,q + rji]), 

i—yQO 

m(p + , q~) = lim fi(\p,p + &] x [q ~ Vh <?]), 

J— >0O 

m(p~,g + ) = lim/i([p-&,p] x [g,g + r/i]), 
m(p",g~) = lim//([p-^,p] x 

Proof. The key observation is that the sequence of rectangles Ri = [p,p + £j x [g, g + 77J 
is cofinal in the set of rectangles R containing (p + ,q + )- In other words, for any such R we 
have Ri R for all sufficiently large i. 

By monotonicity, the sequence of nonnegative integers fi(Ri) is non-increasing, and hence 
eventually stabilises to a limit. Then 

m(p + ,q + ) < min fi(Ri) = lim fi(Ri) < M-R) 

i i— >oo 

for any R containing (p + , q + ). Taking the minimum over all R, the right-hand side becomes 
m(p + , q + ) and hence by squeezing 

m(p + ,q + ) = lim 

i— >oo 

The other three cases of the lemma are similar. □ 
We return to the main proof. 

Step 2. Having defined m(p*, q*), we now show that this is the 'correct' definition, meaning 



that equation (2.10) is satisfied. We have seen already that m corresponds to an r-measure 



(2.13) u(R)= 2j m (?W), 

(p*,q*)eR 

and it remains to show (for this step) that v = fi. We prove this by induction on k = fi(R). 

Base case. fi(R) = 0. Then for every (p*, q*) e R we have 

s; m(p*,g*) < fi(R) = 

so u(R) = 0. 
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Inductive step. Suppose fi(R) = v(R) for every rectangle R with fi(R) < k. Consider a 
rectangle R with /x(-Rq) = k. We must show that v(Rq) = k. 



Split the rectangle into four equal quadrants Si, S 2 , S 3 , £4. Certainly 

fi(R ) = ^Sj + n(S 2 ) + n(S 3 ) + /i(5 4 ) 
v(Ro) = u(Si) + u(S 2 ) + u(S 3 ) + u(S 4 ) 



by finite additivity (Proposition 2.6[ ). If every quadrant satisfies fi(Si) < k, then by induction 
we deduce that /x(-Ro) = v(R ). Otherwise, one of the quadrants has // = k and the other 
three quadrants satisfy \i = (and hence v = 0). Let R\ be the distinguished quadrant, so 
fi(Ri) = k. It is now enough to show that v{R\) = k. 

We repeat the argument. Subdivide Ri into four equal quadrants. Either all four quadrants 
satisfy the inductive hypothesis /i < k, in which case we are done. Otherwise we find a 
quadrant R4+1 with = k, and we are reduced to showing that v{Ri + \) = k. 

In the worst case — the remaining unresolved case — this iteration never terminates and we 
obtain a sequence of closed rectangles 

Rq 3 Ri 3 i?2 3 . . . 

each being a quadrant of the previous one, with fi(Ri) = k. Since the diameters of the 
rectangles tend to zero, their intersection ("Y Ri contains a single point (r, s). 



We are now in a position to show that u(Rq) 
decorated points in R . 



k, by evaluating the sum explicitly over all 



First of all, consider decorated points that eventually leave the sequence (Ri). Specifically, 
suppose that (p*,q*) e -Ro but (p*,q*) e Ri-i — Ri for some i. This means that (p*,q*) 
belongs to one of the three quadrants of Ri-i for which \x = 0. It follows immediately that 
m(p*, q*) = 0. 

Thus, the only contribution to p(Rq) comes from decorated points (p*,q*) which belong to 
every rectangle in the sequence (Ri). Clearly these must be decorated versions (r*, s*) of the 
intersection point (r, s). There are 4, 2 or 1 of them depending on how the nested sequence 
of rectangles converges to its limit. Here we illustrate the three cases: 




Suppose first that (r, s) lies in the interior of every rectangle Ri, so that all four decorated 
points (r + ,s + ), (r + ,s~), (r~,s + ), (r~,s~) belong to every Ri. Divide each Ri into 4 sub- 
rectangles Rt , Ri + , R\ , which share a common corner at (r, s) so that each of the 
four decorated points (r*,s*) belongs to one of the subrectangles in the obvious notation. 
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By Lemma 2.12 



m(r + ,s + ) = lim p,(Rl + ) m(r + ,s ) = lim fi(Rf ) 

m(r~,s + ) = lim m(r~,s~) = lim n{R\ ) 

and moreover each of these decreasing integer sequences eventually stabilises at its limiting 
value. Thus, for sufficiently large i, 

v{Rq) = m(r + , s + ) + m(r + , s~) + m(r~, s + ) + m(r~, s _ ) 

= fiW 
= k 

as required. 

A similar argument (with fewer terms) can be made in the cases where only 2 or 1 of the 
decorated points (r*, s*) belong to every Ri. For instance, if (r, s) lies on the interior of the 
right-hand edge of the rectangles (Ri) for all sufficiently large i, we split each rectangle into 
two parts Rj + and R t and obtain 

u(Rq) = m(r~, s + ) + m(r-,s~) = fi(R^ + ) + ^(R~~) = n(Ri) = k 

in the same way. In this case (r + ,s + ) and (r + ,s~) eventually leave (or were never in) the 
sequence (Ri) and therefore do not contribute to v(Rq). We omit the details of the remaining 
cases, which are equally straightforward. 

This completes the inductive step. Thus fi(R) = v(R) for every R e Rect(D). 

Step 3. Suppose m'(p*,q*) is some other multiplicity function on D m whose associated 
r-measure 

v'(R)= J] mV.O 

(p* ,q*)eR 

satisfies \x = v' . We must show that m = m'. 

Consider an arbitrary decorated point (p*,q*) e D m . Let R be a rectangle which contains 
(p*,q*) at its corner. Since 

v(R) = u'(R) = n(R) < oo, 

there are only finitely many other decorated points (r*, s*) 6 R with positive multiplicity in 
m or m'. By making R smaller, we can therefore assume that (p*, q*) is the only decorated 
point in R with positive multiplicity in either measure. Then 

m(p* ?g *) = U (R) = n(R) = U '(R) = my/). 
Since (p*, q*) was arbitrary it follows that m = m'. 



This completes the proof of Theorem |2.8| □ 
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2.5. Non-finite measures. If a measure is not everywhere finite, we restrict our attention 
to the parts of the plane where it is finite. Define the finite r-interior of an r-measure fi 
to be the set of decorated points 

^"(aO = {(P*> 9*) I there exists R e Rect(D) such that (p* , q*) e R and fi(R) < go} . 

The finite interior is 

$°(p) = {{p, q) I there exists R e Rect(D) such that (p, q) e R° and ft(R) < 00} . 

This is an open subset of the plane, being a union of open rectangles. It is easy to see that 
(j), q) E 3°{fi) if and only if (p*, q*) e ^'(fi) for all possible decorations. 



We can apply Theorem |2.8 to each rectangle R of finite measure to obtain a decorated 



diagram in R a . These diagrams will agree, by uniqueness, on the common intersection of 
any two such rectangles. Therefore we can combine these local definitions to get a well- 
defined multiset Dgm(/i) in the whole of ^"(/i). 

This multiset has the property that p(R) = card(Dgm(yu)|^) for any rectangle R e Rect(2)) 
with finite measure. There is one subtlety to address. 

Proposition 2.14. Let R e Rect(D). If R" c J'{p) then fi{R) < 00. 

Proof. We show that each (p, q) e R is contained in the relative interior of a rectangle S c R 
of finite measure. Then R, being compact, is the union of finitely many of these rectangles, 



and by subadditivity (Proposition 2.7) it must have finite measure. 



If (p, q) lies in the interior of R, then each of the four decorated points (p*,q*) belongs to 
$'(p) so we can find four finite-measure rectangles containing them. The union of these 
rectangles contains a neighbourhood of (p,q), and we can take S c R to be a rectangle 
contained in this union with (p, q) in its interior. It has finite measure, by subadditivity. 

If (p, q) lies on the interior of an edge, we take two finite-measure rectangles containing a 
relative neighbourhood of (p, q); and if (p, q) is a corner point we take just one rectangle. □ 

Corollary 2.15. Let fi be an r-measure on D c R 2 . Then there is a uniquely defined locally 
finite multiset Dgm(/z) in ^"(/i) such that 

fi(R) = card(Dgm(/i)| fi ) 

for every R e Rect(D) with R' c 3*(fi). □ 

Similarly, or as an easy consequence of Proposition |2. 14[ we have: 

Proposition 2.16. Let R e Rect(D). If R c J°(fi) then fi(R) < 00. □ 
Now we can define the diagrams of a general r-measure. 



The decorated diagram of an r-measure fi is the pair (Dgm(/i), ^"(/x)), where Dgm(/i) 
is the multiset in 3 m (p) described in the corollary. 
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The undecorated diagram is the pair (dgm(p), 3^° (//)), where 

dgm(/i) = {(p,q) | (p*,g*) £ Dgm(p)} n ?°(p) 

is the locally finite^] multiset in tF°(fi) obtained by forgetting the decorations in Dgm(p) 
and restricting to the finite interior. 



Remark. The point of Proposition 2.14 is that there is no 'hidden information' in \i beyond 
what is recorded in the diagram. The measure of any rectangle R with R m c jF"(p) is 
recovered by counting decorated points of Dgm(/i), and any other rectangle has infinite 
measure. 



We can bring these definitions into accord with the previous one (for finite r-measures) by 
agreeing that Dgm(p) and dgm(p) are abbreviations for (Dgm(/i),D B ) and (dgm(p), D°), 
when 2) is clear from the context. 

It is sometimes useful to adopt the following extension convention. An r-measure defined 
on a subset T) a R 2 can be interpreted as an r-measure on the whole plane R 2 , by agreeing 
that fi(R) = oo for any rectangle that meets R 2 — D. The extension has the same diagram 
as the original r-measure. 



2.6. The diagram at infinity. We now discuss r-measures in the extended plane R 2 . This 
requires infinite rectangles and decorated points at infinity. 

• For T) c R 2 ; the set Rect(D) consists of all rectangles R D of the form 

R = [a, b] x [c, d] 
where now — oo < a < b < +oo and — oo < c < d < +oo. 

• Decorated points in R 2 are pairs (p*,q*) where p* and q* are decorated real numbers 
or — oo + or +go~. The symbols — oo + , +go~ may be abbreviated to — oo, +oo. 

With these preliminaries, several other concepts are formally unchanged. The r-interior and 
the interior of T> are 

D" = {(p*, q*) | there exists R e Rect(D) such that (p*, q*) e R} , 

= {(P, q) I there exists R e Rect(D) such that (p, q) e R°} . 

One subtlety is that we take R° to be the relative interior of R as a subspace of R 2 . For 
instance, if R = [— oo, b] x [c, d] where b, c, d are finite, then R° = [— oo, b) x (c, d). 

The concept of an r-measure goes through as before: it is a function 

H : Rect(D) -»• {0, 1, 2, . . . , } u {oo} 

which is additive with respect to horizontal and vertical splittings. An r-measure has a finite 
r-interior and a finite interior: 

^"(/•O = {(£>*><?*) I there exists R e Rect(D) such that (p*, q*) e R and fx(R) < oo} , 
^(p) = {(p, q) | there exists R 6 Rect(D) such that (p, g) e i?° and fi(R) < oo} . 

3 As before, dgm is locally finite in 5F°, but may have accumulation points on the boundary of 5°. 
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We sometimes write S(/x) = R 2 — ^(/x), the singular support of /x. 
Claim. Corollary 2.15 is valid for r-measures on D c R 2 . 



Proof. The statements (and indeed the proofs) of Theorem 2.8 and Corollary 2.15 are in- 
variant under reparametrisations of the plane of the form 

x' = f(x), y' = g(y), 

where /, g are homeomorphic embeddings. We can view R 2 as a rectangle in R 2 via a 
transformation of this type; for instance 

x' = arctan(x), y' = arctan(y) 

identifies R 2 with the rectangle [— 7r/2,7r/2] x [— tt/2,tt/2] in R 2 . The theorem and its 
corollary now apply directly. □ 

The decorated diagram (Dgm(/x), ^"(/x)) and the undecorated diagram (dgm(/x), ^(/x)) are 
defined as before. 

Let V be a persistence module. The persistence measure /xy, previously defined by 

fJLv([a,b] x [c,d]) = (o a — o d | V> 

for a < b < c < d, extends easily to infinite rectangles. We allow for the possibility that 
a = —oo or d = +oo by setting 

V- x = o, V +aD = 0. 



In these cases the alternating sum formula of Proposition 2.3 becomes: 
fi v ([-cc,b] x [c, +oo]) = (m b — « C |V> =i c b 
/x v ( [a,b] x [c,+oo]) = <o a — m b — m c I V> =r^-r^ (if r^ < oo) 
/x v ([-oo,6] x [c,d] ) = < •&— « c — o d | V> =r^-r^ (if rf < oo) 

The first of these corresponds to the '/c-triangle lemma' of [TO] . 

So /xy can be thought of as an r-measure on the extended half-plane 

"K = {(p, q) | —oo < p < q < +ocj 
and its diagram Dgm(/xy) is defined on the subset of the r- interior 

ft' = {(p*, q*) I -°° + < p* < <f < +oo"} 

on which /xy is finite. 

Notice that there is a 1-1 correspondence between nonempty real intervals and elem ents 
of "K u . The interval corresponding to (p*,q*) is written {p*,q*) as before. Proposition 



2.1 



and Corollary 2.2 extend straightforwardly to this setting. In particular: 



Corollary 2.17. IfYis decomposable into interval modules then /xy(-R) counts the interval 
summands corresponding to decorated points which lie in R. □ 
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We finish this section by defining the 'measures at infinity' induced by an r-measure /i. The 
extended plane has 4 lines at infinity 

(-00, R), (+00, R), (R, -oo), (R, +oo ) 

and 4 points at infinity 

(-00,-00'), ( + 00, -OO), (-G0,+G0), ( + 00, +00 ) 

and, depending on where /x is finite, there are measures defined on each of these. On the 
lines at infinity these are 'interval measures' (the l-dimensional analogue of r- measures) , and 
at the points at infinity these are simply numbers. 

We write out the three cases of direct relevance to persistence modules. The other cases are 
similar. 

• the line (— oo, R): 

oo, [c, d]) = lim /i([— oo,6] x [c,d]) = min /i([— 00,6] x [c, d]) 

6— oo b 

for any interval [c, d] c R. 

• the line (R, +00): 

/i([a, b], +00) = lim /i([a, b] x [c, +00]) = min /i([a, b] x [c, +00]) 

c— >+ao c 

for any interval [a, b] c R. 

• the point (—00, +00): 

^(—00, +00) = lim /i([— 00, — e] x [e, +00]) = min /i([— 00, — e] x [e, +00]) 

e— >+co e 

Monotonicity of /1 guarantees that each limit exists, except when none of the rectangles in 
the expression belongs to Rect(D). When that happens we adopt the extension convention 
(see section 2.5), and regard these rectangles has having infinite measure and the limit as 



being 00. 

In order for the limit to be finite, we need at least one rectangle in the limiting expression 
to have finite measure. By considering the decorated diagram in that rectangle, it becomes 
clear what the 'measure at infinity' measures: it counts the decorated points of Dgm(/i) that 
lie in the segment, or corner, at infinity. 

For example, when //(— 00, [c,d]) is finite, it counts the decorated points of Dgm(/i) of the 
form (— oo + ,g*) where q* e [c,d]. When it is infinite, it means that some such (— oo + ,g*) 
does not belong to ^"(/f), either because it does not belong to D u or because all rectangles 
containing it have infinite mass. 



2.7. Diagrams of persistence modules. We now have two competing definitions of the 
decorated diagram of a persistence module. 

• If V = ® teL I(p|, <?*] is decomposable into intervals, then define 

Dgm(V) = lnt(V) = {(^,g;)|£eL}. 



This is a multiset in 'K m . 
• Let /iy be the persistence measure of V. Then 

Dgm(V) = DgmO v ). 
This is a multiset in 3 rB (/i), the finite r-interior of /i = /zy- 

The overloading of the term 'Dgm(V)' is partially excused by the following fact. 

Proposition 2.18. If V is decomposable into intervals, then Int(V) agrees with Dgm(^v) 
where the latter is defined, that is, on ^"(/x). 



Proof. By Corollary 2.17 we have 

card(lnt(V)|/j) = fi w (R) 
for all rectangles. On the other hand, we have 

card(Dgm(/x v )U) = H\(R) 

for all rectangles with fXy(R) < go. By uniqueness, it follows that Int(V) and Dgm( / u v ) must 
be the same multiset when restricted to J'fuv). □ 



Neither definition strictly outperforms the other, as the following examples show. 
Example 2.19. Let 

v = ©!&$,<£) 

ieL 

where the undecorated pairs (pe,qe) form a dense subset of the half-plane "K. Then Int(V) 
is defined; but /iv(-R) = 00 for every rectangle, so ^'(/ly) is the empty set and Dgm(yUv) is 
nowhere defined. 



Example 2.20. Adapting the example of Webb [19J to the real line: 

W t = for t > 

Wq = {sequences (xi,x 2 ,x 3 , . . . ) of real numbers} 

W t = {sequences with x n = for all n < \t\] for t < 

As before, this is not decomposable into intervals. On the other hand, it is easy to see that 

<o a — H I W> = conullity(iy a - W b ) < oo 

except when a = -co, and 

{• c — o d I W> = nullity (W c -> W d ) < oo 
except when c < < d. Each of these terms dominates 

(°a— •b— *c— °d | W>, 

which is therefore finite for all rectangles which do not contain (— oo,0 + ). It is not difficult 
to complete the argument that 

=^£--{(-oo,0 + )} 
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Figure 4. The persistence diagram Dgm(W) = Dgm(/xw) f° r the example of 
Webb. This is defined everywhere in the extended half-plane except (—00, + ). 



and to determine the persistence diagram 

DgmGuw) = {(-n + , 0+) | n = 1, 2, 3, . . . } 

over ^'(yu). See Figure |lj In other words, although this example is not decomposable into 
intervals, its persistence measure is well-behaved away from the singular support, which is 
the point (—00, + ). 



2.8. Tameness conditions. We now describe several different levels of 'tameness' for a 
persistence module, starting with the tamest. 

A persistence module V is of finite type, if it is a finite direct sum of interval modules. 

It is locally finite, if it is a direct sum of interval modules such that every t e R has 
a neighbourhood which meets only finitely many of the intervals. Since closed bounded 
intervals are compact, this is equivalent to the stronger assertion that every bounded subset 
of R meets only finitely many of the intervals. 

Proposition 2.21. A persistence module V is locally finite if and only if (i) each V t is finite- 
dimensional and (ii) there is a locally finite set S c R such that v% is an isomorphism for 
every pair b < c with [b, c] n S = 0. 



In less formal language, condition (ii) asserts that V is constant over each interval of the 
open set R S. 



Proof. Suppose conditions (i) and (ii) hold. Select a locally finite countable set 

T : • • • < t_ n < ■ ■ ■ < t-x < t < ti < ■ ■ ■ < t n < . . . 

unbounded in both directions, where the even- numbered points include the entire set S. By 
the result of Webb used in Theorem 1.4 , we can decompose Vt into intervals. Since V is 
constant over each (t2n,^2n+2), this extends uniquely to a decomposition of V into interval 
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Figure 5. A quadrant, horizontal strip, vertical strip, and finite rectangle in 3i. 



modules. Any bounded subset of R meets only finitely many of the t<m and [tin-, hn+2)- 
Applying condition (i) we find that it meets only finitely many of the intervals. 

The converse direction is immediate: take S to be the set of endpoints of the intervals in the 
decomposition of V. □ 



For a finite or locally finite persistent module, it is easy to see that ^"(/u) = 0~C, so both 
persistence diagrams are defined on all of "K, including on the diagonal and at infinity. 
Diagonal points must be decorated (p~,p + ), since only these belong to rectangles in "K. 

We introduce four more kinds of tameness. The assumptions here are about finiteness of /xy 
over different types of rectangle (see Figure [5]). Each condition guarantees the existence of 
the persistence diagram over a certain subset of the extended half-plane. The finite part of 
the plane (except the diagonal) is always included; it is at infinity that the four conditions 
differ. 

(i) We say that V is q-tame, if < 00 f° r every quadrant Q not touching the diagonal. 

In other words 

<•&— « c | V> < oo 

(that is, rg < oo) for all b < c. The persistence diagram Dgm(/iv) is defined over the set: 

{(p*, q*) | -oo •;: j> < q • +00} = 



(ii) We say that V is h-tame, if [iy(H) < 00 for every horizontally infinite strip H not 
touching the diagonal. In other words, 

(•b— « c — °d I V> < 00 

for all b < c < d. The persistence diagram Dgm(/iy) is defined over the set: 

{(P* 3 <?*) I < q < +00} = 
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Figure 6. Diagrams of persistence modules which are: (left) h-tame and 
v-tame but not q-tame; (right) r-tame but not h-tame or v-tame. 



(iii) We say that V is v-tame, if fiy(V) < go for every vertically infinite strip V not touching 
the diagonal. In other words, 

(°a— *b— «c | V> < 00 

for all a < b < c. The persistence diagram Dgm(/iv) is defined over the set: 

{(p*, q*) | —oo < p < q < +00} = 

(iv) We say that V is r-tame, if fJ,y(R) < 00 for every finite rectangle R not touching the 
diagonal. In other words, 

(°a— *b— *c— °d I V> < 00 

for all a < b < c < d. The persistence diagram Dgm(/iv) is defined over the set: 

{(p*, q*) I — oC' < p < q < +00} = 

Here is the diagram of inclusions between the different classes of tame persistence modules: 

v-tame 




finite — > locally finite ->■ q-tame r-tame 




h-tame 

One can show that all the inclusions are strict, and also 

q-tame c h-tame n v-tame, h-tame u v-tame £ r-tame. 
Examples for these last two assertions are suggested by the diagrams in Figure [6j 

We have not determined whether 

h-tame © v-tame = r-tame. 

If an r-tame module is decomposable then it can certainly be written as the direct sum of 
an h-tame module and a v-tame module, by partitioning the intervals appropriately. The 
general situation seems more subtle. 
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Later we will show that 'q-tame' can be thought of as the completion of 'locally finite': a 
persistence module is q-tame if and only if it can be approximated arbitrarily well by locally 
finite modules. See Theorem 14.191 

Many naturally occurring persistence modules are q-tame. Here is a typical example. 

Theorem 2.22. Let X be a finite polyhedron]^ and let f : X — > R be a continuous function. 
Then persistent homology H(X su b) of the sublevelset filtration of (X, /) is q-tame. 



Proof. For any b < c we must show that 

R(X b ) — H(X C ) 

has finite rank. Begin with any triangulation of X, and subdivide it repeatedly until no 
simplex meets both f~ l (b) and / _1 (c). If we define Y to be the union of the closed simplices 
which meet X b , then we have 

X b c y c X c 

and hence the factorisation 

E(X b ) — H(Y) — H(X C ). 

Since Y is a finite polyhedron, H(Y) is finite dimensional, so H(X b ) —*■ H(X C ) has finite 
rank. □ 

Remark. Individual terms vector spaces H(X 6 ) in the persistence module may be infinite- 
dimensional. For instance let X be a closed disk in the plane, and let / be a non-negative 
function whose zero set is a Hawaiian earring; then H(Jf°) is uncountable-dimensional. How- 
ever, the theorem shows that such pathological behaviour is ephemeral; it does not persist 
over any nontrivial interval of parameter values. 



Other examples are developed in [8]. It is shown, for instance, that the Vietoris-Rips and 
Cech complexes of a pre-compact metric space have q-tame persistent homology. 

With a little more work we can show: 

Theorem 2.23. Let X be a locally compact polyhedron, and let f : X —*■ R be a proper 
continuous function. Then persistent homology H(X su b) of the sublevelset filtration of (X, /) 
is h-tame and v-tame (and r-tame). 



Proof, h-tameness. Let b < c < d. We must show that 
(h-*) <•&— •c— o d | H(X sub )> < oo. 

Begin with a triangulation of X. Only finitely many simplices meet the compact set 
so again after a finite number of subdivisions no simplex meets both f~ l (b) and / _1 (c). 

Now let Y be the union of the closed simplices which meet X b , and let Z be the union of 
the closed simplices which meet X d . This gives a diagram of inclusions 

X b c Y c X c c X d c Z. 



By 'polyhedron' we mean the realisation of a simplicial complex as a topological space. 
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Note that the polyhedron Z differs from its subpolyhedron Y by the addition of only finitely 
many simplices, since each such simplex must meet the compact set f _1 [b, d\. Thus the 
relative homology H(Z,Y) is finite-dimensional. 

We now work with the induced homology diagram 

H(X 6 ) H(Y) -> H(X C ) -> H(X rf ) -> H(Z). 

In the obvious notation, 

<•&- -•c— °d > = ^ *b *y *c °(i 

< < »!/- -°^> 

= dim[ker(H(F) -> H(Z))]. 

By the homology long exact sequence for the pair (Z, Y), we have 

ker(H(F) -> H(Z)) = im(H(Z, F ) -> H(Y)) 

which is finite-dimensional. This confirms (h-*). 

v-tameness. Let a < b < c. We must show that 

(v-*) <o a — » b — « c | H(X sub )> < oo 

Using a similar argument to the above, we construct a diagram of inclusions 

Y c X a c X b c Z c X c 

where Y, Z are polyhedra with H(Z, Y) finite-dimensional. Working with the homology 
diagram 

H(Y) -> H(X a ) -> H(X 6 ) -> H(Z) -> H(X C ), 

we estimate 

< °a— -*c> = <°j/— °a— »6— «c> 

< <°y- > 

= dim[coker(H(F) -> H(Z))] 

By the homology long exact sequence of the pair (Z, Y), we have 

coker(H(F) -> H(Z)) im(H(Z) -» H(Z,Y)) 

which is finite-dimensional. This confirms (v-*). □ 

Remark. The theorem essentially asserts that the only possible bad behaviour of H(X sub ) 
occurs at (—00, +00). It is easy to construct examples which are definitely not q-tame.The 
simplest example is X = Z (the integers), with f(n) = n. The O-homology of any sublevelset 
is infinite dimensional, and all inclusions have infinite rank. 
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Figure 7. A persistence module V, discretised at a, b, c. The persistence 
diagram Dgm(V a> fe jC ) is localised at six grid vertices. The multiplicity of each 
vertex is equal to the number of decorated points of Dgm(V) in the rectangle 
immediately below and to the left of it. Decorated points of Dgm(V) in the 
remaining triangular regions do not show up in Dgm(V a) b )C ). 

2.9. Finite approximations. Away from the finite r-interior, the r-measure gives a decid- 
edly limited view of the structure of a persistence module. For example: 

• It is not possible to distinguish between the many nonisomorphic persistence modules 
V for which /iy is infinite on every rectangle. 

• If the persistence diagram of V contains a sequence of points (p*, q*) with p n , q n con- 
verging to r from below and above, respectively, then there is no way to determine the 
multiplicity of (r~,r + ) from the measure alone. 

On the other hand, from /jy we do recover all information obtainable by restricting V to 
finite subsets T a R. We may call this the 'finitely observable' part of V. Specifically, for 
any finite index set 

T : ai < a 2 < ■ ■ ■ < a n 

we have 

(t) <[<*!>%] I V T> = < Q 0i _i— •ai — •aj— °a j+1 \ V> = ^ V ([Oi-l, <k] x [%> %+l]) 

(where a Q ,a n+ i are interpreted as — go, +oo respectively). 

This can be interpreted as a sort of 'snapping principle' (see [HIE] for the origin of this term). 
The left-hand side of (jf| gives the multiplicity of the interval [ai,aj +1 ) in the traditional 
definition of the barcode for Vt- For us, this is [a~, aj +1 ). The right-hand side of (FR) counts 
the decorated points of Dgm(/iy) in the rectangle [aj_i,aj] x [aj,aj + i]. See Figure [7j 

There are some well known situations in which the entire structure of V is finitely observable. 
For example, suppose 

• X is a compact manifold and / is a Morse function; or 
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• X is a compact polyhedron and / is piecewise linear. 

In each case there is a finite set of critical points 

T : a\ < a 2 < • • • < a n 

such that the inclusion of sublevelsets X s a X 1 is a homotopy equivalence provided that (s, t] 
does not meet any of the critical points. It follows that the sublevelset persistent homology 
V = H(X su b) is constant on each of the intervals 

(-00, ai), [ai,a 2 ), [a 2 ,a 3 ), [a n _i,a n ), [a n ,+oo). 

The structure of V is therefore determined by V-r, it is of finite type, and Dgm(V) is localised 
at the points 

(a[, aj) for 1 < i < j < n, 

and 

(a~, +00) for 1 < i < n. 
3. Interleaving 

As with any category, two persistence modules U,V are said to be isomorphic if there are 
maps 

$ e Hom(U,V), 
* e Hom(V, U), 

such that 

= lu, 
$^ = l v . 

This relation is too strong for situations where the data leading to the construction of a 
persistence module is obtained with some uncertainty or noise. The natural response is to 
consider a weaker relation, 5-interleaving, where 5^0 quantifies the uncertainty. 

In this section, we define the interleaving relation and study its elementary properties. We 
prove the nontrivial result (from [6]) that if two persistence modules are 5- interleaved, then 
they are connected in the space of persistence modules by a path of length 5. This 'interpo- 
lation lemma' is a crucial step in the proof of the stability theorem in section |4| 

In this section, all persistence modules are indexed by R unless explicitly stated otherwise. 

3.1. Shifted homomorphisms. The first step is to consider homomorphisms which shift 
the value of the persistent index. Let U, V be persistence modules over R, and let S be any 
real number. A homomorphism of degree 5 is a collection $ of linear maps 

(pf.U t ^ V t+5 
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for all t e R, such that the diagram 

U s '-^ U t 



V s +s — ^ V t+S 

commutes whenever s ^ t. 
We write 

Hom <5 (U, V) = {homomorphisms U — > V of degree 5}, 
End <5 (V) = {homomorphisms V — » V of degree 5}. 

Composition gives a map 

Hom 42 (V,W) x Hom 5l (U,V) -> Hom^+^U, W). 

For 5^0, the most important degree-^ endomorphism is the shift map 

4 £ End 5 (V), 

which is the collection of maps (fJ +<5 ) from the persistence structure on V. If $ is a homo- 
morphism U — > V of any degree, then by definition 



(Jji <5 _ i <5 <T) 



for all 5^0. 



3.2. Interleaving. Let 5^0. Two persistence modules U, V are said to be ^-interleaved 

if there are maps 

$ e Hom 5 (U,V) 
e Hom 5 (V,U) 



such that 



m = i 
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More expansively (with many more indices), this means that there are maps 

<f> t :U t -> V t+ 5 and : v t -> U t+ s 
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defined for all t, such that the following diagrams 



U, 



„t+s 



s — o 



s-5 



s+5 



(3.1) 



V 3+s 
V„ - 



-* V t+5 



„*+« 



v; 



s—o 



■0t 



8-5 



s+<5 



U s+ s — - — > U t+ s 



commute for all eligible parameter values; that is, for all s < t. Where possible, we will be 
concise rather than expansive. 

Here is the classic example of a pair of interleaved persistence modules. 

Example 3.2. Let X be a topological space and let /, g : X — * R. Suppose ||/ — g\ x < 5. 
Then the persistence modules H(X{ ub ), H(Xg Ub ) are 5-interleaved. Indeed, there are inclusions 

(Xjy^(X,g) t+5 

(x, g y^(x,f) t+s 



for all t, which induce maps 



* : H(X»J 



H(Xf ub ) 
H(Xf ub ) 



of degree 5. Since all the maps are induced functorially from inclusion maps, the interleaving 
relations are automatically satisfied. 



This is the situation for which the stability theorem of Cohen-Steiner, Edelsbrunner and 
Harer [in] was originally stated: if two functions /, g are close then the diagrams for their 
sublevelset persistent homology are close. Subsequently, stability was formulated as a theo- 
rem about the diagrams of interleaved persistence modules P [7j. In the present paper, we 
come to view stability as a theorem about r-measures. 



3.3. Interleaving (continued). An interleaving between two persistence modules can itself 
be thought of as a persistence module over a certain partially ordered set (poset). We develop 
this idea now. 

Consider the standard partial order on the plane: 



(Phil) < (P2, <?2) <=> Pi < T>2 and q 1 «S q 2 . 
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For any real number x, define the shifted diagonal 

K = {(p,q) \q-p = 2x}^R 2 . 

As a poset, this is isomorphic to the real line. Specifically, we identify ieR with the point 
(t — x, t + x) £ A x . Through this, we get a canonical identification between persistence 
modules over R and persistence modules over A x . 

Proposition 3.3. Letx,y be real numbers. Persistence modules U, V are \y — x\-interleaved 
if and only if there is a persistence module W over A x u A y such that W\a x = U and 
W| Ay =V. 

Proof. We may assume x < y. 

We claim that (i) the extra information carried by (y-x)-inter leaving maps \I/ is equivalent 
to (ii) the extra information carried by W. Let us describe both, more carefully: 

(i) In addition to U, V we have a system of maps $ = ((f> t ), where 

4> t :U t -> V t+ y- x , 

and a system of maps \l/ = (ipt), where 

V>t : V t -> Ut+y-x- 

These are constrained by the relations (for all rj 5= 0). 

(t) *is = i$*, *q = il% *$ = il y - 2x , ^ = il y - 2x . 

There are no other constraints. 

(ii) In addition to U, V the persistence module W carries maps between the two components 
A x , A y . These maps are constrained by the composition law 

T T S 

W R = Wg O Wft 

for all R, S, T e A x u A y with R^S < T. 

First, observe that we recover the maps <f>t, ipt as vertical maps from A x to A y , and horizontal 
maps from A y to A x , respectively (see Figure [8]): 



Ut — Wt- x ,t+x ~ * Wt-x,t+2y-x — Vt+y-x 

V t = W t -y : t + y — " W t + y-2x,t+y — Ut + y-x 

Next, observe that the composition law implies all of the relations ([f]). 

Finally, there is no additional information in W, beyond the interleaving maps and relations. 
Indeed, all remaining maps Wg, where S < T, can all be factored in the form: 

w s = v l+ y -x 0( Ps if S e A x and T e A y , 
w 5 = u l+ y -x ° if S e A y and T e A x . 
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Figure 8. The maps $, recovered from the module W over A x u A 



Thus each map in W is an instance of one of 




from A x to 
from A y to A y 
from A x . to A y 
from A y to A x 



It is a simple matter to verify that the composition law is satisfied for each composable pair 
of maps. For instance 



This can be done using only the known relations, so there are no further constraints on 



Remark. This characterisation makes it clear (or, depends on the fact) that all composable 
combinations of the maps u, v, 0, ip from a given domain to a given codomain must be equal: 
indeed, they must agree with the appropriate map Wg of W. 

3.4. The interpolation lemma. In this section we give a proof of the following theorem, 
which first appeared in [6]. The result strikes us as somewhat surprising. 

Lemma 3.4 (Interpolation lemma). Suppose V, V are a 5 -interleaved pair of persistence 
modules. Then there exists a 1 -parameter family of persistence modules (V x \ x g [0,5]) such 
that Uo, U<5 are equal to U, V respectively, and U x , V y are \y — x\-interleaved for all x,y s [0, 5] . 

We can make a sharper statement about what is proved. Given a specific pair of interleaving 
maps 



2x _ -i rj+(;+2y-2x 




□ 



$ g Hom 5 (U,V) 
e Hom 5 (V,U) 
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the construction explicitly provides, for each x < y, a pair of interleaving maps 

$^Hom ri (U„U 9 ) 

vi^eiio,,," '(:;„.:;,.) 

such that $o; ^5 are equal to $, \1/ respectively, and moreover 

x y x 



z y z 



for all x < y < z. 



In view of Proposition 3.3, this sharp form of the interpolation lemma can be restated as 



follows. (We have also replaced the interval [0,5] with the more general [xo,^i].) 

Theorem 3.5. Any persistence module W over A u As extends to a persistence module 
over the diagonal strip 

A m = {(p,q)\0^q-p^25}^R 2 . 
Remark. The extension is by no means unique. 



Let us clarify how Theorem |3.5| implies Lemma |3.4| If U, V are ^-interleaved, then there 
exists a persi stence module W over A u As such that W|a = U and W|a 5 = V. By 
Theorem 3.5| this extends to W over the strip Ap^. If we define a 1-parameter family 
U x = WIa^, then U x ., are \x — y\ -interleaved for all x, y e [0, 5]. 



The equivalence between Theorem 3.5 and the sharp form of Lemma 3.4 is left to the reader. 



Proof of Theorem 3.5[ In order to express the proof more symmetrically, it is convenient 



to replace the interval [0,5] by the interval [—1,1]. This can be done by a rescaling and 
translation of the plane. Accordingly, suppose we are given a persistence module W over 
A_i u Ai. 

Our strategy is to construct two persistence modules over the strip A[_ 11 ] and a module 
map between them. The image of this map is also a persistence module over the strip, and 
will be the required extension. 



By Proposition 3.3, W provides U = W|a_i and V = W|ai, which we can view as persistence 
modules over R, 



U t = W(t+i, t -i), 
V t = W( t -i,t+i), 



as well as interleaving maps $ e Hom 2 (U, V) and ^ e Hom 2 (V, U) of degree 2. 
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From U, V we construct four persistence modules over R 2 : 



A = U[p - 1] 


defined by 


A(p,q) 


= U P - 1 


and 


(r,s) 


— "p-1 


B = V[q- 1] 


defined by 


B (p,q) 


= V1 


and 


Ar,s) 
%.«) 




C = U[q + 1] 


defined by 




= u q+1 


and 


C (P,<7) 


= < + + \ 


D = V[p + 1] 


defined by 


D (p,q) 


= Vi 


and 


j(*V») 


= «S1 



Next, we construct four module maps. 



lu : A 


— > C defined at (p, q) to be 


q+l 
U p-1 




-u q+1 


$ : A 


— > D defined at (p, q) to be 


0P-1 


C/ p -i- 


■* Vp+l 


* : B 


— > C defined at (p, q) to be 








ly : IB 


— > O defined at (p, q) to be 


p+1 
<-l 


^1 - 





The maps <3>, \I/ are defined over the whole plane, whereas the map lu is defined only where 
p — 1 < q + 1, and the map ly is defined only where q — 1 < p + 1. It follows that all four 
maps are defined in the region where 

-2 < q -p < 2, 

which is precisely the strip A[_ 11 ]. Henceforth, we restrict to that strip. 

It is easy to confirm that these are module maps. The required commutation relations involve 
composable combinations of the maps u, v, <p,ip, which always agree by the remark following 
Proposition |3.3| 



Define SI e Horn (A © B, C © D) by the 2-by-2 matrix 

lu * 
$ l v 

of module maps. Our claim is that W = im(fi) is the required extension. An equivalent 
statement is that W = coim(f2) is the required extension (using the coimage rather than the 
image). 

Step 1: W|a_i is isomorphic to U. 



Proof. On A_i = {(t + 1, t - 1)}, we have 

(A © M) t = Ut@V t -2, 
(C©D) t = Ut®Vt+2, 

and the homomorphism S7| a_i is given by 

,t 
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The key point is that this factorises (in a block- matrix sense it 'has rank 1'). The factorisation 
can be written in matrix form 



u t = 

or as a diagram of R-module maps 



U0V[t-2] U U0V[t + 2] 



where 



Qi(u© v) = u + \P(v) and 2 (u) = u © $(u). 



Since Qi is surjective and fi 2 is injective (at all indices t), it follows that 

coim(fi) = U = im(fi) 
where the isomorphisms are given by Qi,Q 2 respectively. 

Step 2. W|ai is isomorphic to V. 

Proof. On Ai = {(t - l,t + 1)}, we have 

(A0B) ( = U t - 2 ®V t , 
(C0D) t = C/ t+2 ©^, 

and 



Again Q factorises. The factorisation can be written in matrix form 



[ <k-2 V\ ] 



or as a diagram of R-module maps 



U[t - 2] ©V V U[t + 2] © V 

where 

fi 3 (u© v) = $(u) + v and fi 4 (v) = *(v) © v. 
Since f2 3 is surjective and ^4 is injective, we have 

coim(fi) ^ V ^ im(fi) 
where the isomorphisms are given by Q3, respectively. 



□ 



□ 



Step 3. Under the isomorphisms above, the cross maps of W are precisely <3> and 
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Proof. The cross maps for W are induced by the cross maps for A © B and also by the cross 
maps for C©B. (The result must be the same, because Q is a module homomorphism.) 



The following diagram shows the vertical maps for A © B (on the left) and C 
right), as well as the factorisations of Steps 1 and 2. 



(on the 



uev[t-2] 



iue4 



U©V[t + 2] 



n 3 



Y[t + 2] 



n 4 



U©V[t + 2] 



l, 4 ,©lv 



U[t + 4]0V[t + 2] 



The diagram, with $ in the middle, is easy verified to commute using the explicit forms of 
the maps It follows that $ is the induced vertical cross map for W. 



A similar argument using the diagram 

u[t-2]ev — 



n 4 



U[t + 2]©V 



V[t + 2] © V — ► U[t + 2] U[t + 2] © V[t + 4] 



shows that \I/ is the induced horizontal cross map. 



□ 



This completes the proof of Theorem 3.5 



□ 



We note that Step 3 isn't necessary to deduce the interpolation lemma (3.4), because it is 



sufficient to demonstrate that U, V belong to some 1-parameter family of persistence modules. 
It is important that the \x — y\ -interleaving between each pair U x ,U y exists, but it doesn't 
matter what it is. 



One can interpret the stronger Theorem 3.5 as a functor extension theorem. If we regard the 



posets A XQ u A Xl and A^^] as categories, then persistence modules over these posets are 
the same as f unctors to the category of vector spaces. (See the remark near the beginning 
of section 1.1 ) The theorem asserts the existence of an extension W 



k [a;o,a;i 



A XQ U A Xl 



-> Vect 



for any functor 
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4. The Isometry Theorem 



In this section we discuss the metric relationship between persistence modules and their 
persistence diagrams. As in section [3j all persistence modules are indexed by R unless 
explicitly stated otherwise. 

The principal result is the famous stability theorem of Cohen-Steiner, Edelsbrunner and 
Harer [TU] , in the generality established by [7J . The main difference here is that we emphasise 
persistence measures in the proof, and indeed we derive the standard theorem from a more 
general statement about persistence measures. The basic structure of the proof is essentially 
the same as in [10J. 

The secondary result is the converse inequality, which together with the stability theorem im- 
plies that the space of q-tame persistence modules is isometric with the space of locally finite 
persistence diagrams. This isometry theorem appeared originally in the work of Lesnick [16] 
for modules which satisfy dim(V^) < go for all t, and independently in the work of Bubenik 
and Scott [3] for modules of finite type. 



4.1. The interleaving distance. In this section we define the interleaving distance between 
persistence modules. This was introduced in [7J. 

The first observation is that if U and V are <5-interleaved, then they are (5 + e)-interleaved 
for every e > 0. Indeed, the maps 



provide the required interleaving. 



The challenge, then, if two persistence modules are interleaved, is to make the interleaving 
parameter as small as possible. The minimum is not necessarily attained, so we introduce 
some additional terminology. 

We say that two persistence modules U, V are 5 + -interleaved if they are (5 + e)-interleaved 
for all e > 0. As we shall see, this does not imply that U, V are 5-interleaved. 

Example 4.1. Two persistence modules are 0- interleaved if and only if they are isomorphic. 

Example 4.2. A persistence module V is ephemeral if t>* = for all s < t. The spaces V t 
can be completely arbitrary. Let U and V be two non-isomorphic ephemeral modules. Then 
U, V are + -interleaved but not 0-interleaved. Indeed, 1^ = and ly = for all e > 0, so 
the zero maps 

$ = e Hom e (U,V) 
f = 0e Hom e (V,U) 



give an e-interleaving. 
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The interleaving distance between two persistence modules is denned as follows: 

di(U, V) = inf{5 | U, V are 5-interleaved} 

= min{5 | U, V are <5 + -interleaved} 
If there is no ^-interleaving between U, V for any value of 5, then di(U, V) = go. 
Proposition 4.3. The interleaving distance satisfies the triangle inequality: 

di(U, W) < di(U, V) + di(V, W) 
for any three persistence modules U, V, W. 

Proof. Given a (^-interleaving between U, V and a (^-interleaving between V, W one can 
construct a 5 = (8\ + (^-interleaving between U, W by composing the interleaving maps: 

One easily verifies that $ = $2$i and ^ = ^1^2 are interleaving maps. Explicitly: 
m = ^ 1 ^ 2 $ 2 $ 1 = V^&i = ^i$ilu 2 = lu 5ll u 2 = lu 

m = $2$^!^ = $ 2 lf* 2 = $ 2 * 2 1^ = = 1 2 4 

Now take the infimum over 5%, 5 2 . □ 

The proposition tells us that dj is a pseudometric. It is not a true metric because di(U, V) = 
does not imply U = V, as we saw above. In fact, two q-tame persistence modules have 
interleaving distance if and only if their undecorated persistence diagrams are the same. 
This is a consequence of the isometry theorem. 



Here is the simplest instance. The direct proof is left to the reader (or see Proposition 4.6). 
Example 4.4. The four interval modules 

I[p,q], I\p,q), I(p,q], I(p,q) 
are + -interleaved but not isomorphic. 

The following property of interleaving distance will be useful later. 

Proposition 4.5. Let Ui,U 2 , Vi, V 2 be persistence modules. Then 

diOUa U 2 , Vi V 2 ) < max (di(Ui, Vi), di(U 2 , V 2 )) 

More generally, let (JJe \ £ e L) and (Ye \ £ e L) be families of persistence modules indexed by 
the same set L, and let 

u = 0u,, v = 0v £ . 

IeL ieL 

Then 

di(U, V) < sup (di(U £ , Y e ) \ £ e L) . 
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Proof. Given <5-interleavings ^e,^e for each pair Ve,Ve, the direct sum maps $ = ©3^, 
= (J) ^>£ constitute a 5-interleaving of U, V. Thus any upper bound on the di(U^, V^) is an 
upper bound for di(U, V). In particular, this is true for the least upper bound, or sup. □ 

4.2. The bottleneck distance. Now we define the metric on the other side of the isometry 
theorem, namely the bottleneck distance between undecorated persistence diagrams. For a q- 
tame persistence module V, every rectangle not touching the diagonal has finite //y-rcieasure. 
This implies that the undecorated diagram 

dgm(V) = dgm(/i V ) 

is a multiset in the extended open half-plane 

< K° = {{p, q) | —oo < p < q < +00}. 

In order to define the bottleneck distance, we need to specify the distance between any pair 
of points in "K° , as well as the distance between any point and the diagonal (the boundary 
of the half-plane). These distance functions are not arbitrary; they are defined as they are 
because of the interleaving properties of interval modules. 

(point to point): The first idea is that two undecorated diagrams are close if there is a 
bijection between them which doesn't move any point too far. We use the £°°-metric in the 
plane: 

d°°((p, q), (r, s)) = max (\p - r\,\q - s\) 
Points at infinity are compared in the expected way: 

(T((-oo,g),(-oo,s)) = \q-s\, 
d°°((p,+oo),(r,+oo)) = \p-r\, 

and 

cT((-oo, +00), (-00, +00)) = 0. 
Distances between points in different strata (e.g. between (p,q) and (— 00, s)) are infinite. 

The next result gives a relationship between the ^°°-metric and the interleaving of interval 
modules. 

Proposition 4.6. Let [p*,q*] and [r*,s*} be intervals (possibly infinite), and let 

U = l[p*,q*) and V = I(r*,s*] 
be the corresponding interval modules. Then 

diCu.vxd 00 ^,?),^^)). 

The proof is postponed to the end of the section. We remark that equality holds provided 
that the intervals overlap sufficiently (the closure of each interval must meet the midpoint 
of the other), so the proposition is tight in that sense. 

(point to diagonal): The next idea is that points which are close to the diagonal are 
allowed to be swallowed up by the diagonal. Again we use the £°°-metric: 

d co ((p,q),A) = l(q-p) 
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Again this is related to the behaviour of interval modules. 
Proposition 4.7. Let {p*,q*} be an interval, let 

U = l{p*,q% 

be the corresponding interval module, and let denote the zero persistence module. Then 

di(u,o) = !(<?-?). 

(This is infinite if the interval is infinite.) 

Proof. Let 5^0. When is there a 5- interleaving? The interleaving maps must be zero 
(no other maps exist to or from the module 0), so the only condition that needs checking 
is = l^f, which is really = 1§ . This holds when 5 > \{q — p) and fails when 
5<\{q-p). ' □ 

We now use these two concepts to define the bottleneck distance between two multisets A, B 
in the extended half-plane. 

It is easier to work with sets rather than multisets. One way to do this is to attach labels 
to distinguish multiple instances of each repeated point. For instance, a with multiplicity k 
becomes «i, . . . Henceforth, we will do this without much comment. 

A partial matching between A and B is a collection of pairs 

McAxB 

such that: 

• for every a e A there is at most one (3 e B such that (a, 0) e M; 

• for every f3 e B there is at most one a e A such that (a, /3) e M. 

We say that a partial matching M is a (^-matching if all of the following are true: 

• if (a,/3) 6 M then d°°(a,/3) ^ 5; 

• if a 6 A is unmatched then d°°(a, A) ^ 5; 

• if p 6 B is unmatched then d c0 (/3, A) ^ 5. 

The bottleneck distance between two multisets A, B in the extended half-plane is 

db(A, B) = inf (5 \ there exists a 5-matching between A and B) . 

In section 4.3 , we will show that 'inf can be replaced by 'min' if A, B are locally finite. 

Remark. In order for db(A, B) < go, it is necessary that the cardinalities of A, B agree over 
each of the three strata at infinity: 

card(A| { _ oo}xR ) = card(B| { _ oo}xR ) 

card(A| Rx{+00 }) = card(B| Rx{+00 }) 

card(A| { _ oo}x{+oo} ) = card(B| { _ oo}x{+oo} ) 
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gives or 




Figure 9. The partial matchings between A, B and B,C (left) compose to 
give a partial matching between A, C in two different ways (middle, right). 
The non-uniqueness arises from the point of multiplicity 2 in B. 

Indeed, these points have infinite distance from the diagonal and from points in the other 
strata, and therefore they must be bijectively matched within each stratum. 

Proposition 4.8. The bottleneck distance satisfies the triangle inequality: 

d b (A,C)<d b (A,B) + d b (B,C) 

for any three multisets A, B, C. 

Proof. Suppose Mi is a <5i-matching between A, B, and M 2 is a 5 2 -matching between B, C. 
Let 5 = 5± + 82- We must show that there is a 5-matching between A, C. 

Define the composition of Mi, M 2 to be 

M = ((a, 7) I there exists e B such that (a, (3) e M x and (/3, 7) e M 2 ) . 
We verify that M is the required 5-matching: 

• If (a, 7) e M then 

d°>, 7) < d°>, p) + d°°(/3, 7) < 5 1 + 5 2 = 5 
where /3 e B is the point linking a to 7. 

• If a is unmatched in M then there are two possibilities. Either a is unmatched in Mi, 
in which case 

Or a is matched in Mi, let's say (ot,/3) 6 Mi. Then /3 must be unmatched in M 2 , so 

d°°(a, A) < d°°(a, p) + d°°(/3, A) < 5 1 + 5 2 = 5. 

• If 7 is unmatched in M, then a similar argument shows that 

d°°( 7 ,A)^5. 

This completes the proof. □ 

Remark. Because A, B, C are in truth multisets rather than sets, the composition operation 
between matchings is not uniquely defined, but depends on how the matchings are realised 
when labels are added. Figure [9] illustrates what can happen when B has points of multiplicity 
greater than 1. Since we are concerned only with the existence of the composite matching, 
this ambiguity does not trouble us. 
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Here is the first substantial-looking result comparing the interleaving and bottleneck dis- 
tances. 

Theorem 4.9. Let U, V be decomposable persistence modules (i.e. direct sums of interval 
modules). Then 

d i (U,V)<d b (dgm(U),dgm(V)). 
(We remind the reader that tameness is not required to define Dgm and dgm in this case: 



see section 1.6 



Proof. We show that whenever there exists a 5- matching between dgm(U) and dgm(V), we 
have d;(U, V) ^ 5. The result follows by taking the infimum over all such 5. 

Let M be a ^-matching between the two diagrams. Since the points in each diagram corre- 
spond to the interval summands of the module, we can construct from M a partial matching 
between the interval summands of U and V. 

Re-write U and V in the form 

u = 0u £ , v = 0v, 

leL leL 

so that each pair (IIJ^, Vg) is one of the following: 

• a pair of matched intervals; 

• is an unmatched interval, = 0; 

• is an unmatched interval, = 0. 



In each case, by Propositions 4.6 and 4.7, we have di(U£, Yf) ^ 5. It follows from Proposi- 
tion IO that di(U, V) < S. □ 



We complete this section with the postponed proof. 

Proof of Proposition ^. 6] We treat the case where p, q, r, s are all finite. We must show that 
if 

5 > max (\p — r\,\q — s\) 
then U, V are 5-interleaved. We define systems of linear maps 

$ = : U t - V t+S ) 
= : V t - U t+S ) 

and then show that the interleaving relations 

$1^ = 1^$, *1^ = 1^, = ^ = l 2 Y S 

hold. 
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The definition of the maps (f>t,i>t is straightforward. Each vector space in U, V is equal to 
zero or to the field k. If the domain and codomain equal k, then the map is defined to be 
the identity 1 = lk- Otherwise, the map is necessarily 0. 

The first step is to show that the systems of maps $ = (0 t ), \& = (ip t ) are module homomor- 
phisms. For <£> this entails verifying that the diagram 

U t ► U t+r] 

Vt+s > V t+v+ s 

commutes for all t and for all rj > 0. Because of the special form of the vector spaces and 
maps, it is enough to show that the situation is not one of the following: 

• > • • > o 



\ \ or \ \ 
o > • • > • 

Here a filled circle • indicates that the vector space is k, and an open circle o denotes that 
it is zero. For the first situation to occur, one must have 

p* < t and t + 5 < r* 

but 5 > r — p so this is impossible. For the second situation to occur, one must have 

q* <t + rj and t + 77 + 5 < s* 

but 5 > s — q so this too is impossible. It follows that $ is a module homomorphism. By 
symmetry, so is \&. 

The second step is to show that = 1^ and = ly. For the first of these, we must 
verify that the diagram 

U t ► Ut+25 



V t+S 

commutes for all t. This time the only forbidden configuration is: 

• ► • 

\ / 

o 

If this occurs then the top row implies: 

p* < t and t + 25 < q* 

Since 5 > r — p and 8 > q — s we infer that 

r* < (p + 5)* < t + 5 < (q - 5)* < s* 

which implies that the circle on the bottom row is filled after all. It follows that = l^f. 
By symmetry, = ly . 

This completes the proof when p, q, r, s are finite. The infinite cases are similar. □ 
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Remark. In the proof we have used and manipulated inequalities between decorated and 
undecorated real numbers. The definitions are the natural ones: 

p~ < p < p + < q~ < q < q + 

whenever p < q are undecorated real numbers. The proof can be written without this; for 
instance, the inequality p* < t can be replaced by the marginally weaker assertion p ^ t, 
and so on. 

4.3. The bottleneck distance (continued). If A, B are locally finite, it turns out that 
the 'inf is attained in the definition 

db(A, B) = inf (8 | there exists a 5-matching between A and B) , 

and can be replaced by 'min'. This will allow us to make a tighter statement of the stability 



theorem (4.11') for q-tame modules. See Theorem 4.20 



Theorem 4.10. Let A, B be locally finite multisets in the extended open half-plane t K° . 
Suppose for every rj > 5 there exists an rj-matching between A, B. Then there exists a 5- 
matching between A, B. 

The assertion is obvious if A, B are finite. The general case is proved using a compactness 
argument, as follows. 

Proof. As usual we treat A, B as sets rather than multisets. 

For every integer n ^ 1, let M n be a (5 + -)-matching between A, B. The plan is to construct 
a ^-matching M from the sequence (M n ). In practice, we work with the indicator functions 

X : Ax B^{0,1} 

X»:AxB-{0,l} 

of the partial matchings M, M n . 

The first step is to construct x as a limit of the sequence (xn)- Take a fixed enumeration 

{(ai,p t )\£>l) 

of the countable set A x B. We will inductively construct a descending sequence 

N = N 2 Ni 5 • ■ • 2 N< 5 . . . 

of infinite subsets of the natural numbers, with the property that Xn(c*e, fa) takes the same 
value for all n e N^. Having done so, we define x( a e, fa) to be this common value. 

The construction of is straightforward: once N^_x is defined, at least one of the two sets 

{n e N £ _i | Xn(®e, Pe) = 0} and {n e N £ _i | Xn((*£, fa) = 1} 
has infinite cardinality, and that will be our N^. (If both, then either will do.) Repeat. 
Lemma. If F is any finite subset of A x B, then there exists £ ^ 1 such that 

x(a,P) = Xn(a,/3) 

for all (a, /?) 6 F and for all n e N^. 
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Proof. Indeed, select £ such that (ai, pi), ■ ■ ■ , (on, pi) include all of F. 



□ 



The second step is to verify that x is the indicator function of a ^-matching. There are 
several items to check. 

• For as A there is at most one P e B such that x( a -,P) = 1- 

Proof. Suppose x( a > P) = x( a i P') = 1 f° r distinct elements (3, (3' e B. By the 
lemma, there exists n such that Xn( a i P) = Xn( a iP') = 1, which contradicts 
the fact that M n is a partial matching. □ 

• For a 6 A with d c0 (a, A) > 5, there is at least one (3 e B such that /5) = 1. 

Proof. Select such that d°°(a;, A) > 5 + i. Then the set 

F a = {/36B|d co (a,/3)<5 + i} 

is finite, since B is locally finite and these points lie in a square bounded away 
from the diagonal. By the lemma, there exists £ such that 

for all /3 e F a and for all n e N^. On the other hand, if n ^ AT, then M n 
matches a with some /3 e F a . Combining these observations, 

= Xn(a,P) = 1 

for sufficiently large neN( and for some /3 e F„. □ 
By symmetry we have: 

• For /3 e B there is at most one aeA such that x{ a i P) = 1- 

• For /3 e B with d°°(/3, A) > 5, there is at least one aeA such that x( a i P) = 1- 
Finally: 

• If P) = 1 then d°°(a, 0) < 5. 

Proof. By the lemma, there are infinitely many n for which Xn( a iP) = 1- 
Then 

d°>,/3)^+i 

for these n. Since n may be arbitrarily large, the result follows. □ 

These five bullet points confirm that M, defined by its indicator function x, is a 5-matching 
between A, B. □ 



Remark. Although we have chosen to give a direct argument, we point out that Theorem 4.10 



is an instance of the compactness theorem in first-order logic. The set of constraints that 
must be satisfied by an ^-matching can be formulated as a theory T v on a collection of 
binary- valued variables x a p. An ^-matching is precisely a model for that theory. The theory 
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Ts is seen to be logically equivalent to the union of the theories (T v \ rj > S). If each T v has 
a model, then any finite subtheory of this union has a model, therefore by compactness Tg 
has a model. The details are left as an exercise for the interested reader. 



4.4. The isometry theorem. Having defined the interleaving distance and the bottleneck 
distance, we can now state the main theorem. 

Theorem 4.11. Let U, V be q-tame persistence modules. Then 

di(U,V) =d b (dgm(U),dgm(V)) 

(Recall that dgm denotes the undecorated persistence diagram.) 

The result naturally falls into two parts: the 'stability theorem' [TOj |6] 

di(U,V) ^d b (dgm(U),dgm(V)), 



(4.11 



and the 'converse stability theorem' [16] 

(pljQ di(U,V) < d b (dgm(U),dgm(V)). 



The proof of the converse stability theorem (4.11') occupies section 4.5 We have already 
seen the result for decomposable modules, in Theorem |4.9 so it is a matter of extending the 
result to q-tame modules that are not known to be decomposable. 



The proof of the stability theorem (4.11 ) is given in sections 4.6 and 4.7 



4.5. The converse stability theorem. In this section we deduce the converse stability 



inequality (4.11') for q-tame modules from Theorem 4.9 A similar argument is given by 
Lesnick 



It may turn out that q-tame modules are decomposable, in which case this 
extra work becomes unnecessary. 

The key idea is that persistence modules can be approximated by better-behaved persistence 
modules, using a procedure we call 'smoothing'. 

Definition 4.12. Let V be a persistence module, and let e > 0. The e-smoothing of V is 

the persistence module V e defined to be the image of the following map: 

l 2 Y e : V[t - e] - V[t + e] 
In other words, V t e is the image of the map 

v\tl : V t - e -> V t+e , 

and, for s < t, the shift map (v e )l is the restriction of v 



t+e 



Thus we have a factorisation of ly 

(4.13) V[t - e] ► Y e > V[t + e] 

where the first map is surjective and the second map is injective (at all t). At a given index 
t this is the sequence: 

v t - € v: v t+e 
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Proposition 4.14. LetV be a persistence module. Then d;(V, V e ) < e. 



Proof. Indeed, the maps in (4.13) give an e-interleaving. □ 



Smoothing changes the persistence diagram in a predictable way. Here is the atomic example 
(which the reader can easily verify): 



Example 4.15. Let V = l[p*,q*). Then: 
V e = 



I((p + e)*,(q-e)*) if (p + e)* < (q - e) 
otherwise 



In other words, e-smoothing shrinks the interval by e at both ends. 

Proposition 4.16. The persistence diagram ofY € is obtained from the persistence diagram 
ofY by applying the translation T e : (p, q) i— > (p+e, q — e) to the part of the extended half-plane 
that lies above the line A t = {{t — e, t + e) \ te R}. 

In the case where Dgm is not everywhere defined, the proposition is understood to include 
the assertion that the finite r-interior of the persistence measure, and hence the region where 
Dgm is defined, is shifted by T £ . 

Information on Dgm(V) that lies below the line A e is lost in Dgm(V e ). 

Proof. We consider three different cases. Case (ii) is subsumed by case (iii), but the proof is 
easier to digest. 

(i) V is decomposable. The image of a direct sum of maps is the direct sum of the images 
of the maps; therefore e-smoothing commutes with direct sums: 

£eL leL 



By Example 4.15, the proposition is true for interval modules. It is therefore true for direct 



sums of interval modules. 

(ii) V is q-tame. It is enough to show that the rank function of Y e is equal to the rank 
function of V shifted by T e , since this determines the persistence measure and hence the 
persistence diagram. Specifically, for all b < c we require: 

rank[V£ - V c e ] = rank[H_ e -> V c+e ] 

In fact, these maps are related by the sequence 

H-e ► v b e ► v: ► v c+e 

where the first map is surjective and the third map is injective. Since the rank of a linear map 
is unchanged by pre-composing with a surjective map, or post-composing with an injective 
map, it follows that the rank of the middle map is equal to the rank of the composite. This 
is what we wished to prove. 
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(iii) general case. We show that the persistence measure of V e is equal to the persistence 
measure of V shifted by T e . Writing 

A = a — e, B = b — e, C = c + e, D = d + e 

this means showing that 

< °4 — »B — *C — °D | V> = < O a — • b — » c — O d | V> 

for all a < b < c < d. 

The proof is based on the following commutative diagram 

v: ► v b e — > K e ^ ^ 



Vk > V B > V c ► V D 

in which the maps / are surjective and the maps \ are injective. We will construct various 
quiver diagrams from this. Surjectivity means that 

<°4— •a) = and <o B — »6> = 0, 

and injectivity means that 

<« c — °c> = and < «d — o D > = 0. 

Moreover, by the restriction principle, interval types containing any of these 'forbidden' 
configurations occur with multiplicity zero. 



Then 



<°4- -•& — •<:- ~°D> = < °A — •B— *b — »c — *C — °d) 

+ three other terms 

= < °4 — *B — *6 — *c — *C — °D > 

= < °A — «S — •C — °D | V> 



and at the same time 



+ three other terms 



c 

»6 *c °d 



= < o„ — » h — •„ — o 



so we get the required equality. The six 'other terms' are all zero because they contain 
forbidden configurations. □ 

Corollary 4.17. LetY be a q-tame persistence module. Then d b (dgm(V), dgm(V e )) ^ e. 
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Proof. Indeed, an e-matching is denned as follows: 

(p,q) e dgm(V e ) <- (p - e, q + e) e dgm(V) 

This is bijective except for the unmatched points of dgm(V), which lie on or below the line 
A e , and therefore have distance at most e from the diagonal. □ 



Our claim that smoothing makes a persistence module 'better-behaved' amounts to the 
following fact. 

Proposition 4.18. LetY be a q-tame persistence module overH. Then Y e is locally finite. 
In particular, V e is decomposable into intervals. 



Proof. We make use of the characterisation in Proposition 2.21. Note first that 

dimfV/) = rank[Vt_ e —> V t+€ ] < go 

so condition (i) is satisfied. 

For condition (ii), we must show that there is a locally finite set S c R of 'singular values' 
with the property that V e is constant over each interval of the open set R — S. 

The set S is easily described: consider all points (p*,q*) e Dgm(V) such that p* + 2e < q*. 
Define S to be the union of the sets {p + e, q — e} over all such points of Dgm(V). 

In other words, we take all points of Dgm(V) which lie orj^Jor above the line 

A e = {(t-e,t + e) \ts~R} 

and then project each point both vertically and horizontally onto A e , abandoning the deco- 
ration. The resulting subset of A t defines ScR, using the identification (t — e, t + e) «-»•£. 
See Figure [To| (left). 

We show that S is locally finite. Indeed, for any t and for any rj < e, 

card(5 n (t — t], t + 77)) < 2^ v ([— 00, t + n — e] x [t — 77 + e, +00]) 

= 2 rank[Vt +r) _ e V t - v+e ] 

which is finite. 

We show that V is constant over each component of R — S. 

Claim. The map V£ — ► is an isomorphism if and only if Dgm(V) does not meet the union 
of the rectangles 

[—00, b — e] x [b + e, c + e] and [b — e, c — e] x [c + e, +00]. 



See Figure 10 (right). 

We split the claim into two statements. 



5 Points on the line must be decorated ((t — e) , (t + e) + ). 
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Figure 10. Constructing the set S from the persistence diagram (left). The 
exclusion zone for V£ — * V* to be an isomorphism (right) . 

Part 1. The map V b e — > V* is injective if and only if 

/^ v ([-oo, b - e] x [b + e, c + e]) = 0. 

Part 2. The map V b e — > V* is surjective if and only if 

Hy([b — e, c — e] x [c + e, +oo]) = 0. 

We approach this in the usual way. Consider the following diagram of vector spaces: 

v> ► v: 




V b - e ► V c „ e V b+£ > V c+e 

where the maps / are surjective and the maps \ are injective. Using the abbreviations 

6_ = b — e, b + = b + e, c_ = c — e, c + = c + e 
we can write down the 'forbidden' configurations 

Ofe_ •& •& °c_ *c *c °c+ 

which occur with multiplicity zero. 

For Part 1, note that V b e — » is injective if and only if 

<. b -o c |V e > = 0. 
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Taking the forbidden configurations into account, we calculate: 

< •& — o c | V e > = <— - m b — o c - — > 

= (— ~ •& °c °c+> 

= <- "*6 -° C+ > 

= (*6_ — •& — *6+ — °c+> 
= <•&-- -*&+ — °c+> 

which is exactly /^v([— oo, b — e] x [6 + e, c + e]). 

For Part 2, note that V£ — > is surjective if and only if 

< °b « c I V e > = 0. 

We calculate: 

<O fo — . c |V e > = <- -Q 6 — . c - -> 

= °6 •c- —) 

= (°b- *c > 

= »c_ *c •<:+> 

= <°f>_— *c_- -*c+> 

which is exactly /Xy([6 — e, c — e] x [c + e, +oo]). 
This completes the proof of the claim. 

The claim implies that V is constant on each connected component of R — S. Indeed, if 
S n [b, c] = then the exclusion zone does not contain any points of Dgm(V), and therefore 
(v e )l is an isomorphism. 



Thus V e satisfies both conditions of Proposition 2.21 and is therefore locally finite. □ 



We are now ready to prove the converse stability theorem for q-tame persistence modules, 
using the triangle inequalities for dj, db and our results on e-smoothing. 



Proof of (4-11' )■ Let U, V be q-tame persistence modules. For any e > 0, the e-smoothings 



U e , V e are decomposable, so the converse stability theorem applies to them. Then: 



di(U, V) s; di(U e , V e ) + 2e by Proposition 4.14 



s; d b (dgm(U e ),dgm(V e )) + 2e by Theorem [4T9 
< d b (dgm(U), dgm(V)) + 4e by Corollary [iTf 

Since this is true for all e > 0, we deduce that 

di(U,V) ^d b (dgm(U),dgm(V)). 

The converse stability theorem for q-tame modules is proved. □ 



We finish this section with a characterisation of q-tame modules. 
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Theorem 4.19. A persistence module V is q-tame if and only if it can be approximated, in 
the interleaving distance, by locally finite modules. 



Proof. If V is q-tame then it is approximated by the modules V e , which are locally finite by 



Proposition |4.18| Conversely, suppose V is approximated by locally finite modules. Suppose 
b < c is given. Let W be a locally finite module which is e-interleaved with V, for some 
e < (c- b)/2. Then 



i c b = rank[H -> V c ] 
= rank[T4 W b+e 

< dim(W b+e ) 

< 00. 



It follows that V is q-tame. 



Vc] 



□ 



There certainly exist q-tame modules which are not locally finite, such as: 

oo 
n=l 

As stated before, we do not know whether all q-tame modules can be decomposed into 
intervals. 



4.6. The stability theorem. The inequality (4.11 ') can be expressed in the following form: 



Theorem 4.20. Let U,V be q-tame persistence modules which are 5 + -interleaved. Then 
there exists a S-matching between the multisets dgm(U), dgm(V). 

It is easier to prove the following. (Notice the missing + .) 

Theorem 4.21. Let U, V be q-tame persistence modules which are 5 -interleaved. Then there 
exists a 5 -matching between the multisets dgm(U) ; dgm(V). 



Theorem 4.10 allows us to deduce Theorem 4.20 from Theorem 4.21: if U, V are 5 + - 



interleaved then there is an ^-matching between their diagrams for every r\ > 6, hence 
there is a 5-matching. 



The proof of Theorem 4.21 depends on two main ingredients: 



The interpolation lemma (3.4), which embeds U, V within a 1-parameter family. 



The box lemma (4.22), which relates the persistence measures of U, V locally. 



Once these ingredients are in place, the theorem can be proved using the continuity method 
of [10] . Our persistence diagrams may have infinite cardinality, so we will need an additional 
compactness argument to finish off the proof. 

Let R = [a, b] x [c, d] be a rectangle in R 2 . The 5-thickening of R is the rectangle 

R s = [ a -6,b + 6] x [c-8,d + S\. 
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For convenience we will write 

A = a - 5, B = b + 5, C = c-6, D = d + 5 
in this situation. For infinite rectangles, note that — oo — 5 = — oo and +oo + 5 = +oo. 

We can also thicken an individual point: if a = (p, q) then 

a 5 = [p — 5, p + 5] x [q — 5, q + 5] 

for 5 > 0. 

Lemma 4.22 (Box lemma [10J). Let U, V be a 5 -interleaved pair of persistence modules. Let 
R be a rectangle whose S-thickening R s lies above the diagonal. Then Hn(R) < fxy(R s ) and 
Hv(R) < fi V (R 5 ). 



If we use the extension convention (section [2.5 ) we can state the lemma without the re- 



quirement that R lies above the diagonal, because the convention gives fi(R ) = oo if it 
doesn't. 

Proof. Write R = [a, b] x [c, d\ and R 5 = [A, B] x [C, D] as above. Thanks to the interleaving, 
the finite modules 

Ua,b,c,d ■ U a ^U b ^U c ^U d 

and 

Va,b,c,d : V A ^V B ^Vc^V D 
are restrictions of the following 8-term module 

W : V A -^U a ^U b -^V B ^V c -^U c ^U d -^V D 
where $, \I/ are the interleaving maps. 

Using the restriction principle, we calculate: 

fi Y ([A,B] x [C,D])=(o A - -* B -'c- -o D | V> 

= <°A- -*B—*C- °0 | W) 

= < °A— °a— *B— •C — •c— °d— °£> I W> 

+ eight other terms 
< 0^4— o a — — « B — « c — « c — o d — o D I W> 
= <- -°a— «6— -»c— - I W> 

= (- — °a -*c °<i— - I U> 

= ^u([a,6] x [M]) 

This proves fiv(R) ^ fiy(R s ). The inequality /jly(R) < fiu(R s ) follows by symmetry. □ 



Recall the measures at infinity defined in section 2J3 By considering the appropriate limits, 
we immediately have: 

64 



Proposition 4.23 (Box inequalities at infinity). Let fi, v be r-measures on R 2 which satisfy 
a one-sided box inequality with parameter S 

fjt(R) «S v(R s ) 

for all rectangles R e Rect(R 2 ). Then 

/i([a, b], -oo) < u{[A, B], -co), (i(-co, [c, d]) < v(-cc, [C, D]), 

//([a, 6], +oo) < v([A, B], +oo), //(+oo, [c, d]) < i/(+oo, [C, D]), 

for all a < b and c < d; and 

jtt(— oo, — oo ) < ^(—oo, — oo), A t (+°o, — oo) ^ ^(+oo, — go), 

//( — 00, +00) < — OO, +00), /W( + 00, +00') < I/(+OQ, +00'). 

Here A = a-5, B = b + 5, C = c-5, D = d + 5. □ 

Consequently, if U,V are ^-interleaved persistence modules then /^u, A*v satisfy (two-sided) 
box inequalities on (— oo, R) and (R, +oo) as well as the equality (Jtu(— oo, +oo) = // v (— oo, +oo). 



4.7. The measure stability theorem. We now embed Theorem 4.21 as a special case of 
a stability theorem for the diagrams of abstract r-measures. The more general statement is 
no more difficult^] to prove, and seems to be the natural home for the result. 

Let T) be an open subset of R 2 . For a e D, define the exit distance of a to be 

ex°°(a,D) = d°°(a,R 2 -2)) = min (d°°(a,x) | x e R 2 - D) . 

For instance, for the extended half-plane we have ex 00 (a, "K) = d°°(a, A). 

Let A, B be multisets in D. A 5-matching between A, B is a partial matching M c A x B 
such that 

d°°(a;, /3) < S if a, /3 are matched, 
ex 00 (a, D) < S if a e A is unmatched, 
ex co (/3, D) < S if /3 e B is unmatched. 

If D is not clear from the context, we refer to M as a '^-matching between (A, D) and (B, D)'. 



With the same proof as Proposition 4.8, we have: 



Proposition 4.24 (triangle inequality). //A, B, C are multisets in D and there exists a Si- 
matching between (A, T>), (B,D) and a ^-matching between (B, D), (C,D) ; then there exists 
a (Si + S^-matching between (A,D), (C, D). □ 

Now for the main theorem. 



In fact it's a little easier to prove, because the compactness argument for diagrams with infinitely many 
points can be conducted more cleanly in this generality. 
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Figure 11. The two diagrams (5 dark blue squares; 5 light pink circles) have 
box distance 1 and bottleneck distance 3. Generalising this example, one can 
exhibit a pair of diagrams with 4k + 1 points each, which have box distance 1 
and bottleneck distance 2k + 1. 

Theorem 4.25 (stability for finite measures). Suppose (fi x \ x e [0,<5]) is a 1-parameter 
family of finite r-measures on an open set D c R 2 . Suppose for all x,y e [0,5] the box 
inequality 

holds for all rectangles R whose \y — x\-thickening R\ y ~ x \ belongs to Rect(D). Then there 
exists a 5-matching between the undecorated diagrams (dgm(/io), T>) and (dgm^), D). 



In view of the Interpolation Lemma (3.4), this imp lies Theorem 4.21 (take fi x = fi(JJ x ) and 



D = J-C°) and therefore the stability theorem (4.11') for q-tame modules. 



Example 4.26. The existence of a 1-parameter family interpolating between /x and n$ may 
seem unnecessarily strong. It is natural to hope that two measures /i, v which satisfy the 
(two-sided) box inequality with parameter 5 will have diagrams dgm(/i), dgm(u) which are 
5-matched. This is simply not true, and in fact there is no universal bound on the bottleneck 



distance between the two diagrams. See Figure 11 



Our goal for the rest of this section is to prove Theorem 4.25 Parts 1 and 2 closely follow 
the method of Cohen-Steiner, Edelsbrunner and Harer [TP] . 



Afterwards, in section 4.8, we generalise the theorem to r-measures that are not finite. 



Initial remark. Because the metric d 00 separates R 2 into nine strata (the standard plane, 
the four lines at infinity, and the four points at infinity), we seek separate 5-matchings for 
each stratum that meets D. We begin with the points in the standard plane. 

Temporary hypothesis. Suppose initially that D c: R 2 . 

Part 1. The H aus dor ff distance between (dgm^), D), and (dgm(/^ y ), D) is at most \y — x\. 
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Write A = dgm(p, x ), B = dgm(/i y ), and 77 = \y — x\. The assertion is understood to mean: 

• If a e A and ex 00 (a, 2)) > 77, then there exists (5 e B with d co (o;, /3) < 77. 

• If p e B and ex c0 (/3, D) > 77, then there exists aeA with d c0 (a, /3) «S 77. 

Proof. By symmetry, it is enough to prove the first statement. 

Given such an a, let e > be small enough that 77 + e < ex 00 (a, D). Then the box inequality 
gives 

1 < vM) < ^(«" +e ) 

so there is at least one point of B in the square a v+e . This is true for all sufficiently small 
e > 0, and moreover B is locally finite. Therefore there is at least one point of B in a v . □ 

Henceforth, we will write A x = dgm (p x ) for all x. 

Part 2. The theorem is true if A x has finite cardinality for all x. 

Proof, (i) The triangle inequality for matchings includes the implication 

An, A x are x- matched ) . . , , , 

. ' , x , , , ?• => An, A„ are y-matched 

A x , A y are (y - zj-matched J u ' v y 

whenever < x < y. 

(ii) We claim that for every x e [0, 5] there exists p(x) > such that A x , A y are \y— x|-matched 
whenever y e [0, S] with \y — x\ < p{x). 

Suppose ai,..., ctfc is an enumeration of the distinct points of A x , with respective multiplic- 
ities Hi, . . . , nk- Let p(x) be chosen to satisfy the following finite set of constraints: 



< p(x) «S 



|ex°°(ai,2)) alH 

|d°°(o;j, a,-) all i,j distinct 



We must show that if \y — x\ < p(x) then A^, A y are \y — x|-matched. 
Write 77 = \y — x\ and let 

(R 2 -D)" = {aeD| ex 00 (a, D) ^ 77} . 

It follows from Part 1 that A y is contained entirely in the closed set 

(R 2 - V) r > u a\ u • • • u al 

and it follows from the definition of p(x) that the terms in the union are disjoint. It is easy 
to count the points of A y in each square a*-. Let e > be small enough that 2i] + e < 2p(x). 
Then the box inequality gives 

Tii = p, x {al) ^ p y {a^ +e ) ^ fj >x (aii n+e ) = n { . 

Thus p y (oil +e ) = Hi for all small e > 0. We conclude that the square ci} contains precisely 
Hi points of A y . 
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This completes the proof of (ii), because we can match the n» copies of a* with the rii points 
of A y in the square a 7 }, for each i, to define an ^-matching between (A X ,T>), (A y ,D). All 
points of A;,, are matched, and the only unmatched points of A y lie in R 2 — D and do not 
need to be matched. 



Items (i) and (ii) formally imply that A ,A$ are 5-matched, using the standard Heine-Borel 
argument. Indeed, let 

m = sup(x e [0,5] | A and are x-matched). 

First, m is positive; specifically m ^ p(0). Applying (i) to < ml < m, where A ,A m / are 
m'-matched and m — m' < p(m), we deduce that A , A m are m-matched. Suppose m < 5. 
Applying (i) to < m < m", where m" — m < p(m), we deduce that A , A m » are m"-matched. 
This contradicts the definition of m. Therefore m = S, and A , A^ are <5-matched. □ 



Part 3. The theorem is true without assuming finite cardinality. 



Proof. Let (D n ) be an increasing sequence of open subsets of D whose union equals D and 
such that each D n has compact closure. Because A x is locally finite, it follows that A x n D n 
is finite for all x, n. We can therefore restrict the family of measures to each D n in turn, and 
apply Part 2 to get a 5-matching M„ between (Ao n D n , D n ) and (As n D n , D n ). 

We now take a limit M of the partial matchings M n , using the construction in the proof of 



Theorem 4.10[ (This works because A 0} A$ are locally finite and therefore countable.) Let 



X, Xn denote the indicator functions of M, M„. As before, we have the Lemma that for any 
finite subset F a A x A^, there are infinitely many n e N for which 

X{<*,P) = Xn(a,/3) 

for all (a, (5) e F. 

We must show that M is a 5-matching between (A , D) and (Ag, D). It is immediate that 
each matched pair is separated by at most 5, since this is true for every M n . The argument 
that each a is matched with at most one /3, and vice versa, is the same as before. 

Finally, suppose a e A with ex 00 (a, D) > 8. The square a s is contained in D and is compact, 
and therefore is contained in D n for sufficiently large n. This means that ex 00 (a, T> n ) > 5 and 
hence a is matched in M n for sufficiently large n. Now a has only finitely many 5-neighbours 
Pi, . . . , flk in the locally finite set A,5, so by the Lemma there are infinitely many n such that 
x(a, (3i) = Xn(ot, A) f° r all i- By taking a sufficiently large such n, we conclude that 

X(a, A) = Xn(a, A) = 1 

for some i. Thus a is matched. 

By symmetry, any (3 e As with ex°°(/3, A) > 5 is matched in M to some a e Aq. 



It follows that M is the required 5-matching. 
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□ 



The theorem at infinity. Now suppose 2) c R 2 meets any of the strata at infinity. For each 
of the four lines at infinity, the 3-part proof given above works almost verbatim, if we replace 
T> with its intersection with the chosen line, and each r-measure \i x with the corresponding 
measure at infinity. The other change is to replace the word 'square' with the word 'interval'. 



The necessary box inequality at infinity is found in Proposition 4.23 



For the four corners (+00, +00), it is easier still: the box inequality at each corner implies 
that us have the same multiplicity there. The interpolating measures are not needed. □ 

This completes the proof of the stability theorem for finite measures on an open domain D, 
and hence the stability theorem for q-tame persistence modules, and hence the isometry 
theorem for q-tame persistence modules. 



4.8. The measure stability theorem (continued). The stability theorem generalises to 
measures that are not necessarily finite. By the extension convention, we may suppose that 
the measures are defined on R 2 (rather than just a subset of R 2 ). Given a 1-parameter 
family (n x \ x e [0, 8]), the finite interiors 

now depend on x; whereas previously we had 3 X = T> for all x. 

For J a R 2 an open set and 5^0, the 'reverse offset' is the open set 

J- 5 = {a e 3 I ex°°(a, Tj > 8} = {a e J \ a s a 3} . 

Intuitively, this shrinks J by 8 at the boundary. Clearly J 3 S implies 3 r ~ 5 3 9~ 5 , and 
^Siys 2 = gr-(<5 1+ <5 2 )_ Note algo that (j n 2j-s = 3-8 n g-5 This is easiest to see from the 

second characterisation. 

Remark. The operation [-] _<5 has no effect on the corners at infinity, and acts independently 
on the standard plane and on the four lines at infinity. 

We define 5-matchings for multisets in unequal domains. Let 5F, 9 be open subsets of R 2 , let 
A, B be multisets in £F, 9 respectively, and let 8 > 0. A 5-matching between (A, 3 r ), (B, 9) 
is a partial matching M between A, B such that the following four conditions hold: 

• 5 3 g- 5 and 9 2 -J- s , 

• if {a, 13) e M then d°°(a,/3) < 8, 

• every aeAn S s is matched with some (3 e B, 

• every (3 e B n 3 r ~ s is matched with some a £ A. 

The first of these is a compatibility condition between the domains: they cannot be too 
unequal. This is automatic if 3 = 9, which is why we haven't seen it before. Notice the 
cross-over in the last two conditions: a point in A is allowed to be unmatched only if it is 
close to the boundary of B's domain 9, and vice versa. 

Proposition 4.27 (triangle inequality). //A, B, C are multisets in 3, 9, IK respectively, and 
there exist a 8\-matching between (A, 3 r ), (B, 9) and a 82-matching between (B,S), (C,"K), 
then there exists a (81 + 82)-matching between (A, 3 r ), (C, C K). 
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Proof. As usual, compose the two partial matchings to get a partial matching M between 
A, C. Writing S = 5i + 5 2 , we must check that this is a <5-matching between (A, jF), (C, %). 
For first condition we see that 

J 3 S~ 51 3 (K- S2 )- Sl = 'K- 5 and "K 2 S^ 2 3 (3^*)-* = J - * 5 . 

The second condition follows from the triangle inequality for d 00 . For the third condition, if 
a e A lies in < K~ 5 then by the inclusion above it lies in S~ Sl . Therefore a is matched with 
e B. Moreover /3 must then lie in ft-^- 5 ^ = IK - * 2 and so is matched with 7 e C. The 
fourth condition follows by symmetry. □ 

Remark. There is no triangle inequality if the condition on the domains is dropped. 

Here is the main theorem of this section and the final theorem of the paper. Again we use 
the abbreviation J x = 3^°{ix x ) for the finite interiors. 

Theorem 4.28 (stability for measures). Suppose (fi x \ x e [0,5]) is a 1 -parameter family of 
r-measures on R 2 . Suppose for all x, y e [0,5] the box inequality 

holds for all rectangles R s Rect(R 2 ) . Then there exists a 5-matching between the undecorated 
diagrams (dgm(/x ), 3^) o,nd (dgm (//<s) , J's) . 

The first condition for a 5-matching, on the domains S^, 3\5, can be checked easily: 



Proposition 4.29. Under the hypotheses of Theorem 4-28 , we have inclusions 
for all x, y e [0, 5] . 

Proof. Suppose aeSy'" x \ then equivalently a} y ~ x \ a 3 y . Since the square a^ y ~ x ^ is compact 
and 3 y is open, there exists e > such that w y ~ x \ +e a 3^. The box inequality gives 



and the right-hand side is finite by Proposition 2.16, Thus a e 3^°(ji x ) = J x . □ 



Proof of Theorem \4-2S\ The argument closely follows the proof of the stability theorem for 
finite measure, so we will confine ourselves to indicating the necessary modifications. We 
use the abbreviation = dgm(/i a; ). 

Initial remark. Recall that the proof is conducted separately for each of the nine strata. 
The four corners at infinity are handled easily (each corner belongs to both ^Fq and 3^, or 
to neither; in the former case the [j,q, fi$ multiplicities agree). The proof is described for the 
points in the standard plane. The same proof applies to each of the four lines at infinity, 
replacing each /j, x with the corresponding measure at infinity. 

Part 1. The Hausdorff distance between (A x , 3 X ), and (A y , 3 y ) is at most 77 = \y — x\. 



The assertion is understood to mean: 
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• If a 6 A x and ex 00 (a, 3^) > 77, then there exists (3 e A y with d°°(a, 0) < 77. 

• If /? e Ay and ex°°(/3, 3^) > 77, then there exists aE Aj with d°°(a, /?) < 77. 

Proof: for all e > with 77 + e < ex°°(a:, 3^), we have 1 < /^(a; 6 ) < fi y (a v+e ) so there is at 
least one point of A y in aP. 

Part 2. The theorem is true if A x has finite cardinality for all x. 



Item (i) is given by the triangle inequality (Proposition 4.27). 



Item (ii) uses the same strategy as before. Let (oci) be a finite enumeration of the distinct 
points of A x , with respective multiplicities (rij). Then p(x) is chosen to satisfy: 

n 1 \ ^ ] iex 00 (a i ,3 r x .) all i 
< p(x) < < f 

I |d°°(Q;i, aj) all i,j distinct 

If = |y — x| < p(x), then Part 1 implies that A y is contained in the disjoint union 

(R 2 - 3 x y u ol[ u • • • u a£ 

The box inequality is then used to count precisely 7ij points of A y in the square This 
defines a partial matching where all points of are matched and all points of A y n 3^~ v are 
matched. 

The formal deduction of Part 2 from (i) and (ii) is unchanged, since it is a formal deduction. 
Part 3. The theorem is true without assuming finite cardinality. 

The idea is to restrict each measure p, x to a relatively compact open subset 3\ a 3 r x = 3°(p x ). 
The subsets satisfy the compatibility condition 

for all x,y s [0, 5]. 

Specifically, for e > and r > 6, let 

3\ = 3^ n Q r 

where Q r = (— r, r) x (— r, r) is the open d°°-disk of radius r. Define a function on rectangles 
as follows: 



Ar(-R) 



fi x (R) ifR<=% 
00 otherwise 



It is easy to check that fi x is an r-measure (additivity still holds), that jF ^) = 3^, and 
that dgm(p x ) = dgm^) n %. 

Lemma. The family (fi x ) satisfies the box inequality £t x (R) < jj, y (R} y ~ x \) for all x,y e [0, 5]. 
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Proof. Since the box inequality is assumed to hold for (fi x ), it will automatically hold for 
(fix)', except possibly for rectangles R where the left-hand side of the inequality has become 
infinite while the right-hand side hasn't. This happens when R cj: 3 X while R) y ~ x \ a J y , and 
we can prevent it by ensuring that g\ 3 J y \ y x \. And, indeed, 

jr-\y-x\ = /gr-e n Q r)-|y-x| = j-{e+\ V -x\) n Q r-\y-x\ £ n Q r = $ x . □ 

Since 9^ has compact closure in 9\ r , and A x is locally finite, it follows that = dgm(fi x ) = 
A x n 9\ has finite cardinality We can therefore apply Part 2 to the family (ft x ) to get a 
5-matching between (Ao,^) and (A$, 9^). This can be interpreted as a partial (^-matching 
between Aq,A$ where: 

• a e A is matched whenever a 6 (3"g~ e n a/') -5 = 9^ ^ 5+e ' ) n u/ r ~ 5 

• /3 e A^ is matched whenever /3 e (9^ e n w r ) _<5 = 9 r ^ <5+e ' ) n a/ -5 

Repeat this argument for a sequence (e n ,r n ) where e n — > and r n — >• +00. This gives a 
sequence of 5-matchings M„, and we can form a limit M as before. 

If a e A n 9^ then eventually a e 9 r 5 ( <5+£n ) n w rn ~ <5 and so a is matched by M n for all 
sufficiently large n. The same is true for (3 e A5 n 9 r ~ <5 . With this information, we can 
complete the usual proof that M is a (^-matching between (A , 9 r o) and (As, 9^). □ 



This concludes the proof of Theorem 4.28| Here is a sample consequence. 

Example 4.30 (Stability of the Webb module). Let V be a persistence module which is 
(^-interleaved with the module W of Example 2.20 By interpolation and the box lemma, the 
measure stability theorem applies here. We get a ^-matching between (dgm(/iv), 9 r °(/xv)) 
and (dgm(/i W ), 9 r °(/iw))- This amounts to the following. 

• In the finite part 9£ of the half-plane: 

The singular support of /iy is contained in the diagonal strip ApA. Each point of 
dgm(/i V ) outside this strip is matched with some point (— n, 0) e dgm( / u w ). Conversely, 
the only unmatched points of dgm(/xw) must lie within distance 5 of the diagonal or of 
the singular support of /iy- 

In particular, if 5 < | then all points of dgm(/xw) are matched. 

• On the line (— 00, R): All points and singularities of /xy are contained in the interval 
(—00, [—5, +6]). There is at least one singular point. 

• On (R, +00) and at (—00, +00): The measure fiy has no points or singularities. 



5. Examples 



5.1. Partial inter leavings. In some practical data analysis situations, one considers per- 
sistence modules which are only partially interleaved. One such scenario is presented by 
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t -S 



t a + S 



Figure 12. Left: the partial matching of Theorem 5.1 between dgm(U) (•) 
and dgm(V) (o). Right: projection from dgm(U) (• and •) to dgm(U) (o). The 
grey points are the ones that disappear. 

Chazal et al. in the context of clustering by mode-seeking [9J. A filtered simplicial complex 
on an input point cloud is compared with the sublevelset filtration of the density function it 
was sampled from. In low-density regions, the sample is too sparse to expect there to be an 
interleaving. Nevertheless, there is interleaving when the density is sufficiently high. 

This leads to the following notion of partial interleaving, adapted from [9] . Two persistence 
modules U and V are said to be ^-interleaved up to time t if there are maps (f>t : U t — > V t+ s 



and ij) t : V t —> U t+ g defined for all t < t , such that the diagrams of equation 3.1 commute 
for all values s < t < to! that is, for all values where the maps are defined. 



A weaker version of the stability theorem, illustrated in Figure 12 (left), can be proven: 



Theorem 5.1 (from [9]). Let U and V be two q-tame persistence modules that are 5- 
interleaved up to time to- Then, there is a partial matching M c dgm(U) x dgm(V) with 
the following properties: 

• Points (p,q) in either diagram for which — q\ ^5 are not required to be matched. 

• Points (p, q) in either diagram for which p ^ to — 5 are not required to be matched. 
All other points must be matched. Then: 

• If a, (3 are matched, then the p-coordinates of a,f3 differ by at most S. 

• If a, [3 are matched and one of a, (3 lies below the line q = to, then d°°(a,/3) ^ 5. 

For the proof, we introduce two new persistence modules U,V. 

U t = U t if t < t + 5 and U t = otherwise 
V t = V t if t < t + 5 and V t = otherwise 
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with maps 

tt* = it* if t < t + <5 and w* = otherwise 
t>* = u* if t ^ t + 5 and v l s = otherwise 

for all s < t. We may call U, V the truncations of U, V to (— oo, T], where T = t + 5. 



Proof. There are three steps. 

Step 1. The decorated diagram of a persistence module U determines the decorated diagram 
of its truncation U, in a straightforward way. Specifically, transform each point (p*,q*) £ 
Dgm(U) as follows: 

\p*,q*) ifg*<T 
(P*,Q*) \ {V*,T+) iip*<T<q* 
disappears if T < p* 

Then Dgm(U) is the result of this transformation. The consequent relationship between the 



undecorated diagrams is illustrated in Figure 12 (right). 

Step 2. If U, V are 5-interleaved up to time to, then U, V are 5-interleaved. 

Combining the first two steps we get the third. 

Step 3. The stability theorem gives a 5-matching between dgm(U), dgm(V). This lifts to a 
matching between dgm(U), dgm(V) which has the properties stated in the theorem. 

The second and third steps are straightforward. Only the first step requires any technical 
input, intuitively plausible as it may be. The framework developed in j6] leads to a 2-page 
argument, presented in the appendix of [9]. 

Here is a shorter proof. Write \i = /iu and ft = /x^- Let A denote the multiset obtained from 
Dgm(U) by applying the transformation ([§]). Consider an arbitrary rectangle [a, b] x [c, d] s 
Rect(!K). We easily see: 

' H([a,b] x [c,d]) tfd^T 
card(A| [aib]xM] ) = \ /i([a, b] x [c, +oo]) if c < T < d 

if T < c 

To show that we have correctly determined Dgm(U), it suffices to show that card(A|[ a> 6] x [ Ci rf]) = 
/i([a, b] x [c,d]) for all rectangles. And indeed: 

• If d < T, then: 

/i([a,6] x [c, d\) = {o a — •(,— « c — o d | U> 
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U> = /x([a,fo] x [ c ,d\) 



• If c < T < d, then: 

fi{[a,b] x [c,d]) = (o a —» b —» c —o d | U> 

= <°«- •b— •c— ~ I U> = //([o, 6] x [c, +oo]) 

since [7^ = 0. 

• If T < c, then: 

/2([a,6] x [c,d]) = <o a — . 6 — » c — o d | U> = 

since U c = 0. 

It follows that Dgm(U) = A as claimed. □ 

5.2. Extended persistence. Cohen-Steiner, Edelsbrunner and Harer [llj introduced ex- 
tended persistence to capture the homological information carried by a pair (X, /). Some 
but not all of this information is recovered by the sublevelset persistence H(X su b). The idea is 
to grow the space from the bottom up, through sublevelsets; and then to relativise the space 
from the top down, with superlevelsets. Extended persistence is the persistent homology of 
this sequence of spaces and pairs. 

It is usually assumed that (X, /) has finitely many homological critical points (oj). One 
applies a homology functor to the finite sequence^] 

x a ° _> x ai ► x^- 1 - X - (X,X an ) ► (X,X a2 ) -> (X,X ai ) 

to get a quiver representation. The indecomposable summands of this representation are 
interpreted as features, and are drawn as points in the 'extended persistence diagram'. There 
are three kinds of feature: 

• ordinary features (which are born and die before the central X); 

• relative features (which are born and die after the central X); 

• extended features (which are born before the X and die after it). 

The finiteness assumption is satisfied when (X, /) is a compact manifold with a Morse 
function, or a compact polyhedron with a piecewise-linear map. In the former situation, 
there are extra symmetries (Poincare, Lefschetz) which are explored in [TT] . 

In fact, it is perfectly straightforward to define the extended persistence diagram under a 
weaker hypothesis. Suppose X is a compact polyhedron and / is a continuous real-valued 
function on X. Then: 

• rank (H(X S ) — > H(X*)) < go whenever s < t; and 

• rank(H(X, X s ) — > H(X, X t )) < oo whenever s > t. 



We write X 1 = (X, /)* = / 1 (— oo, t] and X t = (X, f) t = f 1 [t, +oo) for sublevelsets and superlevelsets. 
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The first of these facts is Theorem 2.22| The second is proved similarly, by factorising the 
map through some H(X,Y), where Y is a subpolyhedron of X nested between X s ,X t . 

Define the ordered set 

fL = {t | t e R} ordered by s < t <=> s ^ t, 

thought of as a 'backwards' copy of the real line, with bars under numbers to remind us. For 
extended persistence we may work with the set 

R EP = R U { + 00} U SI 

with the ordering s < +00 < t for all s,t. 

The extended persistence module X EP = X EP for (X, /) is defined as follows: 

V t = H(X*) for t e R 
V +a3 = H(X) 

Vt = H(X,X t ) forteil 

Since R EP is order-isomorphic to the real line, we may interpret X EP it as a persistence 
module over R. The two facts cited above imply that it is q-tame, so the decorated diagram 
is defined away from the diagonal. 

Alternatively, we can define the extended persistence diagram in three pieces: 



AW ([a,b] 


X 


[c,d]) = (o a - 


•c— °d) for -00 < 


a 


< b < c < d < +oC' 


Hrel([a,b] 


X 


[C,d]) = (°gr 


-•b — *c — °d) for +00 < 


( A 


< k < c < d < —00 


^ext([a,6] 


X 


[c,d]) = (o a - 


•c— °d) for -00 < 


a 


< b < +00 and +00 < c < d < —00 


taking V- x 




and V-co = 


whenever needed. 







The measures // rd)A*rei are defined over the half-plane JC, whereas /i ex t is defined over R 2 . 

Stability for dgm ord , dgm rel and dgm ext is proved individually for each diagram. Given two 
functions /, g which are 5-close in the supremum norm, there are inclusions 

(X, /)* c (X, g) t+5 (X,f) t ^(X,g) t _ s 
(X,gY^(X,f) t+s (X,g) t ^(XJ) t _ s 

using which we can prove the box lemma for each measure. Since linear combinations of 
continuous functions are continuous, we can interpolate between / and g to satisfy the 
hypotheses required by the measure stability theorem. 



Remark. In the spirit of Theorem |2.23[ one may treat the case where X is a locally compact 
polyhedron and / is proper. We leave it as an exercise for the sufficiently persistent reader 
to carry this out and locate the potential singularities of the measures. 
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